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Description 
Related Applications 

5 [0001] For U.S. National Stage purposes, this application is a continuation-in-part of U.S. application Serial No. 
08/744,481 , filed November 6, 1996, to Koster, entitled "DNA DIAGNOSTICS BASED ON MASS SPECTROMETRY". 
This application is also a continuation-in-part of U.S. application Serial Nos. 08/744,590, 08/746,036, 08/746,055, 
08/786,988, 08/787,639, 08/933,792 and U.S. application Serial No. atty dkt. no. 7352-2001 B, filed October 8, 1997, 
which Is a continuation-in-part of U.S. application Nos. 08/746,055, 08/786,988 and 08/787,639. For international pur- 

10 poses, benefit of priority is claimed to each of these applications. 

[0002] This application is related to U.S. Patent Application Serial No. 08/61 7,256 filed on March 18,1 996, which is 
a continuation-in-part of U.S. application Serial No. 08/406,199, filed March 17, 1995, now U.S. Patent No. 5,605,798, 
and is also related U.S. Patent Nos. 5,547,835 and 5,622,824. 

[0003] Where permitted the subject matter of each of the above-noted patent applications and the patent is herein 
15 Incorporated in Its entirety. 

BACKGROUND OF THE INVENTION 

Detection of mutations 

20 

[0G04] The genetic information of all living organisms (e.g. , animals, plants and microorganisms) is encoded in de- 
oxyribonucleic acid (DNA). In humans, the complete genome is contains of about 100,000 genes located on 24 chro- 
mosomes (The Human Genome, T. Strachan, BIOS Scientific Publishers, 1 992). Each gene codes for a specific protein, 
which after its expression via transcription and translation, fulfills a specific biochemical function within a living cell. 

25 Changes in a DNA sequence are known as mutations and can result in proteins with altered or in some cases even 
lost biochemical activities; this in turn can cause genetic disease. Mutations include nucleotide deletions, insertions or 
alterations (i.e. point mutations). Point mutations can be either "missense", resulting in a change in the amino acid 
sequence of a protein or "nonsense" coding for a stop codon and thereby leading to a truncated protein. 
[0005] More than 3000 genetic diseases are currently known (Human Genome Mutations, D. N. Cooper and M. 

30 Krawczak, BIOS Publishers, 1993), including hemophilias, thalassemias, Duchenne Muscular Dystrophy (DMD), Hunt- 
ington's Disease (HD), Alzheimer's Disease and Cystic Fibrosis (CF). In addition to mutated genes, which result in 
genetic disease, certain birth defects are the result of chromosomal abnormalities such as Trisomy 21 (Down's Syn- 
drome), Trisomy 13 (Patau Syndrome), Trisomy 18 (Edward's Syndrome), Monosomy X (Turner's Syndrome) and other 
sex chromosome aneuploidies such as Klienfelter's Syndrome (XXY). Further, there is growing evidence that certain 

35 DNA sequences may predispose an individual to any of a number of diseases such as diabetes, arteriosclerosis, 
obesity, various autoimmune diseases and cancer (e.g. , colorectal, breast, ovarian, lung). 

[0006] Viruses, bacteria, fungi and other infectious organisms contain distinct nucleic acid sequences, which are 
different from the sequences contained in the host cell. Therefore, infectious organisms can also be detected and 
identified based on their specific DNA sequences. 

40 [0007] Since the sequence of about 1 6 nucleotides is specific on statistical grounds even for the size of the human 
genome, relatively short nucleic acid sequences can be used to detect normal and defective genes in higher organisms 
and to detect infectious microorganisms ( e.g. , bacteria, fungi, protists and yeast) and viruses. DNA sequences can 
even sen/e as a fingerprint for detection of different individuals within the same species (see, Thompson, J. S. and M. 
W. Thompson, eds.. Genetics in Medicine , W.B. Saunders Co., Philadelphia, PA (1991)). 

45 [0008] Several methods for detecting DNA are currently being used. For example, nucleic acid sequences can be 
identified by comparing the mobility of an amplified nucleic acid fragment with a known standard by gel electrophoresis, 
or by hybridization with a probe, which is complementary to the sequence to be identified. Identification, however, can 
only be accomplished if the nucleic acid fragment is labeled with a sensitive reporter function (e.g. , radioactive 
^32p 35s) fluorescent or chemiluminescent). Radioactive labels can be hazardous and the signals they produce decay 

50 over time. Non-isotopic labels (e.g. , fluorescent) suffer from a lack of sensitivity and fading of the signal when high 
intensity lasers are being used. Additionally, perfomning labeling, electrophoresis and subsequent detection are labo- 
rious, time-consuming and error-prone procedures. Electrophoresis is particularly error-prone, since the size or the 
molecular weight of the nucleic acid cannot be directly correlated to the mobility In the gel matrix, it Is known that 
sequence specific effects, secondary structure and interactions with the gel matrix are causing artefacts. 

55 

Use of mass spectrometry for detection and Identification of nucleic acids 

[0009] Mass spectrometry provides a means of "weighing" individual molecules by ionizing the molecules in vaccuo 


2 


EP 1 164 203 A2 


and making them "fly" by volatilization. Under the influence of connbinations of electric and magnetic fields, the ions 
follow trajectories depending on their individual mass (m) and charge (z). In the range of molecules with low molecular 
weight, mass spectrometry has long been part of the routine physical-organic repertoire for analysis and characteri- 
zation of organic molecules by the determination of the mass of the parent molecular ion. In addition, by arranging 

5 collisions of this parent molecular ion with other particles ( e.g. , argon atoms), the molecular ion is fragmented forming 
secondary ions by the so-called collision induced dissociation (CID). The fragmentation pattern/pathway very often 
allows the derivation of detailed structural infomnation. Many applications of mass spectrometric methods are known 
in the art, particularly in biosciences (see, e.g. . Methods in Enzymol. , Vol. 193: "Mass Spectrometry" (J. A. McCloskey, 
editor), 1990, Academic Press, New York). 

10 [0010] Because of the apparent analytical advantages of mass spectrometry in providing high detection sensitivity, 
accuracy of mass measurements, detailed structural infomnation by CID in conjunction with an MS/MS configuration 
and speed, as well as on-line data transfer to a computer, there has been interest in the use of mass spectrometry for 
the structural analysis of nucleic acids. Recent reviews summarizing this field include K.H. Schram, "Mass Spectrometry 
of Nucleic Acid Components, Biomedical Applications of Mass Spectrometry" 34, 203-287 (1 990); and P.F. Crain, "Mass 

15 Spectrometric Techniques in Nucleic Acid Research," Mass Spectrometry Reviews 9, 505-554 (1 990); see, also U.S. 
Patent No. 5,547,835 and U.S. Patent No. 5,622,824). 

[0011] Nucleic acids, however, are very polar biopolymers that are very difficult to volatilize. Consequently, mass 
spectrometric detection has been limited to low molecular weight synthetic oligonucleotides forconfimriing an already 
known oligonucleotide sequence by determining the mass of the parent molecular ion, or alternatively, confirming a 

20 known sequence through the generation of secondary Ions (fragment ions) via CID in an MS/MS configuration using, 
in particular, for the ionization and volatilization, the method of fast atomic bombardment (FAB mass spectrometry) or 
plasma desorption (PD mass spectrometry). As an example, the application of FAB to the analysis of protected dimeric 
blocks for chemical synthesis of oligodeoxynucleotides has been described (Koster et al. (1987) Biomed. Environ. 
Mass Spectrometry 14 , 111-116). 

25 [0012] Other ionization/desorption techniques include electros pray/ion -spray (ES) and matrix-assisted laser desorp- 
tion/ionization (MALDI). ES mass spectrometry has been introduced by Fenn et al. (J. Phys. Chem. 88 :4451-59 (1984); 
PCT Application No. WO 90/14148) and current applications are summarized in review articles (see, e.g. . Smith et al. 
(1990) Anal. Chem . 62:882-89 and Ardrey (1992) Electrospray Mass Spectrometry, Spectroscopy Europe 4:10-18). 
The molecular weights of atetradecanucleotide (see. Covey etaL(1 988) The "Determination of Protein, Oligonucleotide 

30 and Peptide Molecular Weights by lonspray Mass Spectrometry," Rapid Commun. in Mass Spectrometry 2:249-256), 
and of a 21 -mer (Methods in Enzymol. , 193 , "Mass Spectrometry" (McCloskey, editor), p. 425, 1 990, Academic Press, 
New York) have been published. As a mass analyzer, a quadrupole is most frequently used. Because of the presence 
of multiple ion peaks that all could be used forthe mass calculation, the determination of molecular weights infemtomole 
amounts of sample is very accurate. 

35 [0013] MALDI mass spectrometry, in contrast, can be attractive when a time-of -flight (TOP) configuration (see, 
Hillenkamp et al. (1990) pp 49-60 in "Matrix Assisted UV-Laser Desorption/lonization: A New Approach to Mass Spec- 
trometry of Large Biomolecules," Biological Mass Spectrometry , Burlingame and McCloskey, editors, Elsevier Science 
Publishers, Amsterdam) is used as a mass analyzer. Since, in most cases, no multiple molecular ion peaks are produced 
with this technique, the mass spectra, in principle, look simpler compared to ES mass spectrometry. 

40 [0014] Although DNA molecules up to a molecular weight of 410,000 daltons have been desorbed and volatilized 
(Williams etaf., "Volatilization of High Molecular Weight DNA by Pulsed Laser Ablation of Frozen Aqueous Solutions," 
Science 246 , 1585-87 (1989)), this technique had only shown very low resolution (oligothymidylic acids up to 18 nucle- 
otides, Huth-Fehre et al. Rapid Commun. in Mass Spectrom. , 6, 209-13 (1992); DNA fragments up to 500 nucleotides 
in length K. Tang etai. Rapid Commun. in Mass Spectrom. , 8, 727-730 (1 994); and a double-stranded DNA of 28 base 

45 pairs (Williams etal., "Time-of-Flight Mass Spectrometry of Nucleic Acids by Laser Ablation and Ionization from a Frozen 
Aqueous Matrix," Rapid Commun. in Mass Spectrom. , 4, 348-351 (1 990)). Japanese Patent No. 59-131 909 describes 
an instrument, which detects nucleic acid fragments separated either by electrophoresis, liquid chromatography or high 
speed gel filtration. Mass spectrometric detection is achieved by incorporating into the nucleic acids, atoms, such as S, 
Br, I or Ag, Au, Pt, Os, Hg, that normally do not occur in DNA. 

50 [0015] Co-owned U.S. Patent No. 5,622,824 describes methods for DNA sequencing based on mass spectrometric 
detection. To achieve this, the DNA is by means of protection, specificity of enzymatic activity, or immobilization, uni- 
laterally degraded in a stepwise manner via exonuclease digestion and the nucleotides or derivatives detected by mass 
spectrometry. Prior to the enzymatic degradation, sets of ordered deletions that span a cloned DNA fragment can be 
created. In this manner, mass-modified nucleotides can be incorporated using a combination of exonuclease and DNA/ 

55 RNA polymerase. This permits either multiplex mass spectrometric detection, or modulation of the activity of the exo- 
nuclease so as to synchronize the degradative process. Co-owned U.S. Patent Nos. 5,605,798 and 5,547,835 provide 
methods for detecting a particular nucleic acid sequence in a biological sample. Depending on the sequence to be 
detected, the processes can be used, for example, in methods of diagnosis. These methods, while broadly useful and 
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applicable to numerous embodiments, represent tlie first disclosure of such applications and can be improved upon. 
[0016] Therefore, it is an object herein to provided improved methods for sequencing and detecting DNA molecules 
in biological samples. It is also an object herein to provided improved methods for diagnosis of genetic diseases, 
predispositions to cetain diseases, cancers, and infections. 

5 

SUMMARY OF THE INVENTION 

[0017] Methods of diagnosis by detecting and/or determing sequences of nucleic acids that are based on mass 
spectrometry are provided herein. Methods are provided for detecting double-stranded DNA, detecting mutations and 

10 other diagnostic markers using MS analysis. In particular, methods for diagnosing neuroblastoma, detecting heredity 
relationships, HLA compatibility, genetic fingerprinting, detecting tele ro mas e activity for cancer diagnosis are provided. 
[0018] In certain embodiments the DNA is immobilized on a solid support either directly or via a linker and/or bead. 
Three permutions of the methods for DNA detection in which Immobilized DNA is used are exemplified. These include: 
(1 ) immobilization of a template; hybridization of the primer; extension of the primer, or extension of the primer (single 

15 ddNTP) for sequencing or diagnostics or extension of the primer and Endonuclease degradation (sequencing); (2) 
immobilization of a primer; hybridization of a single stranded template; and extension of the primer, or extension of the 
primer (single ddNTP) for sequencing or diagnostics or extension of the primer and Endonuclease degradation (se- 
quencing); (3) immobilization of the primer; hybridization of a double stranded template; extension of the primer, or 
extension of the primer (single ddNTP) for sequencing or diagnostics or extension of the primer and Endonuclease 

20 degradation (sequencing). 

[0019] In certain embodiments the DNA is immobilized on the support via a selectively cleavable linker. Selectively 
cleavable linkers include, buta are not limited to photocleavable linkers, chemically cleavable linkers and an enzymat- 
ically (such as a restriction site (nucleic acid linker), a protease site) cleavable linkers. Inclusion of a selectively cleavable 
linker expands the capabilities of the MALDI-TOF MS analysis because it allows for all of the permutations of immo- 

25 bilization of DNA for MALDI-TOF MS, the DNA linkage to the support through the 3'- or 5'-end of a nucleic acid; allows 
the amplified DNA or the target primer to be extended by DNA synthesis; and further allows for the mass of the extended 
product (or degraded product via exonuclease degradation) to be of a size that is appropriate for MALDI-TOF MS 
analysis (i.e., the isolated or synthesized DNA can be large and a small primer or a large primer sequence can be used 
and a small restriction fragment of a gene or single strand thereof hybridized thereto). 

30 [0020] In a preferred embodiment, the selectively cleavable linker is a chemical or photocleavable linker that is 
cleaved during the ionizing step of mass spectrometry. Exemplary linkers include linkers containing, a disulfide group, 
a leuvinyl group, an acid-labile trityl group and a hydrophobic trityl group. In other embodiments, the enzymatically 
cleavable linker can be a nucleic acid that is an RNA nucleotide or that encodes a restriction endonuclease site. Other 
enzymatically cleavable linkers include linkers that contain a pyrophosphate group, an arginine-arginine group and a 

35 lysine-lysine group. Other linkers are exemplified herein. 

[0021] Methods for sequencing long fragments of DNA are provided. To perform such sequencing, specific base 
terminated fragments are generated from a target nucleic acid. The analysis of fragments rather than the full length 
nucleic acid shifts the mass of the ions to be determined into a lower mass range, which is generally more amenable 
to mass spectometric detection. For example, the shift to smaller masses increases mass resolution, mass accuracy 

40 and, in particular, the sensitivity for detection. Hybridization events and the actual molecular weights of the fragments 
as determined by mass spectrometry provide sequence infomnation ( e.g. , the presence and/or identity of a mutation). 
In a preferred embodiment, thefragments are captured on a solid support prior to hybridization and/or mass spectrom- 
etry detection. In another preferred embodiment, the fragments generated are ordered to provide the sequence of the 
larger nucleic acid. 

45 [0022] One preferred method for generating base specifically terminated fragments from a nucleic acid is effected 
by contacting an appropriate amount of a target nucleic acid with an appropriate amount of a specific endonuclease, 
thereby resulting in partial or complete digestion of the target nucleic acid. Endonucleases will typically degrade a 
sequence into pieces of no more than about 50-70 nucleotides, even if the reaction is not run to full completion. In a 
preferred embodiment, the nucleic acid is a ribonucleic acid and the endonuclease is a ribonuclease (RNase) selected 

50 from among: the G-specific RNase T-|, the A-specific RNase U2, the A/U specific RNase PhyM, U/C specific RNase 
A, C specific chicken liver RNase (RNase CL3) or crisavitin. In another preferred embodiment, the endonuclease is a 
restriction enzyme that cleaves at least one site contained within the target nucleic acid. Another preferred method for 
generating base specifically terminated fragments includes performing a combined amplification and base-specific 
termination reaction (e.g. , using an appropriate amount of a first DNA polymerase, which has a relatively low affinity 

55 towards the chain-terminating nucleotides resulting in an exponential amplification of thetarget; and a polymerase with 
a relatively high affinity forthe chain terminating nucleotide resulting in base-specific termination of the polymerization. 
Inclusion of a tag at the 5' and/or 3' end of a target nucleic acid can facilitates the ordering of fragments. 
[0023] Methods for determining the sequence of an unknown nucleic acid in which the 5' and/or 3' end of the target 
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nucleic acid can include a tag are provided. Inclusion of a non-natural tag on the 3' end is also useful for ruling out or 
compensating fertile influence of 3' tieterogeneity, premature ternnination and nonspecific elongation. In a preferred 
embodinnent, tine tag is an affinity tag ( e.g. , biotin or a nucleic acid that hybridizes to a capture nucleic acid). Most 
preferably the affinity tag facilitates binding of the nucleic acid to a solid support. In another preferred embodiment, the 
tag is a mass marker (i.e. a mariner of a mass that does not correspond to the mass of any of the four nucleotides). In 
a further embodiment, the tag is a natural tag, such as a polyA tail or the natural 3' heterogeneity that can result, for 
example, from a transcription reaction. 

[0024] Methods of sequence analysis in which nucleic acids have been replicated from a nucleic acid molecule 
obtained from a biological sample are specifically digested using one or more nucleases (deoxyribonucleases for DNA, 
and ribonucleases for RNA) are provided. The fragments captured on a solid support carrying the corresponding com- 
plementary sequences. Hybridization events and the actual molecularweights of the captured target sequences provide 
infomation on mutations in the gene. The array can be analyzed spot-by-spot using mass spectrometry. Further, the 
fragments generated can be ordered to provide the sequence of the larger target fragment. 

[0025] In another embodiment, at least one primer with a 3'-terminal base is hybridized to the target nucleic acid 
near a site where possible mutations are to be detected. An appropriate polymerase and a set of three nucleoside 
triphosphates (NTPs) and the fourth added as a terminator are reacted. The extension reaction products are measured 
by mass spectrometry and are indicative of the presence and the nature of a mutation. The set of three NTPs and one 
dd-NTP (or three NTPs and one 3'-deoxy NTP), will be varied to be able to discriminate between several mutations 
(including compound heterozygotes) in the target nucleic acid sequence. 

[0026] Methods for detecting and diagnosing neoplasla/malagnancles in a tissue or cell sample are provided. The 
methods rely on a telomeric repeat amplification protocol (TRAP) -MS assay and include the steps of: 

a) obtaining a tissue or a cell sample, such as a clinical isolate or culture of suspected cells; 

b) isolating/extracting/purifying telomerase from the sample; 

c) adding the telomerase extract to a composition containing a synthetic DNA primer, which is optionally immobi- 
lized, complementary to thetelomeric repeat, and all four dNTPs under conditions that result in telomerase specific 
extension of the synthetic DNA; 

d) amplifying the telomerase extended DNA products,, preferably using a primer that contains a "linker moiety", 
such as a moiety based on thiol chemistry or streptavidin; 

e) isolating linker-amplified primers, such as by using a complementary binding partner immobilized on a solid 
support; 

f) optionally conditioning the DNA for crystal formation; and 

g) performing MS by ionizing/volatizing the sample to detect the DNA product. 

Telomerase-specific extension is indicative of neoplaisa/malignancy. 

This method can be used to detect ect specific malignancies. The use of MS to detect the DNA product permits iden- 
tification the extended product, which is indicative of telomerase activity in the sample. If desired, the synthetic DNA 
can be in the form an array. 

[0027] Methods for detecting mutations are provided and the use thereof oncogenes and to thereby screen for trans- 
formed cells, which are indicative of neoplasia. Detection of mutations present in oncogenes are indicative of trans- 
formation. This method includes the steps of: 

a) obtaining a biological sample; 

b) amplifying a portion of the selected proto-oncogene that includes a codon indicative of transformation, where 
one primer has a linker moiety for immobilization; 

c) immobilizing DNA via the linker moiety to a solid support, optionally in the form of an array; 

d) hybridizing a primer complementary to the proto oncogene sequence that is upstream from the codon 

6) adding 3dNTPs/1 ddNTP and DNA polymerase and extending the hybridized primerto the next ddNTP location; 

f) ionizing/volatizing the sample; and 

g) detecting the mass of the extended DNA, whereby mass indicates the presence of wild-type or mutant alleles. 
The presence of a mutant allele at the codon is diagnostic for neoplasia. 

In an exemplary embodiment, extenslon-MS analysis Is used detect the presence of a mutated codon 634 In the ret- 
rovirus (RET)-proto oncogene. 

[0028] In another embodiment, methods for diagnosing diseases using reverse transcription and amplification of a 
gene expressed in transformed cells. In particular, a method for diagnosis of neuroblastoma using reverse transcriptase 
(RT)-MS of tyrosine hydroxylase, which is a catecholamine biosynthetic enzyme that expressed in tumor cells, but not 
in normal cells, such as normal bone marrow cells is provided. The method includes the steps of: 
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a) obtaining a tissue sample; 

b) isolating polyA RNAfrom the sample; 

c) preparing a cDNA library using reverse transcription; 

d) amplifing a cDNA product, or portion tliereof, of tlie selected gene, where one oligc primer has a linker moiety; 
5 e) isolating the amplified product by immobilizing the DNAto solid support via the linker moiety; 

f) optionally conditioning the DNA: 

g) ionizing/volatizing sample and detecting the presence of a DNA peak that is indicative of expression of the 
selected gene gene. For example, expression of the tyrosine hydroxylase gene is indicative of neuroblastoma. 

10 [0029] Also provided are methods of directly detecting a double-stranded nucleic acid using MALDI-TOF MS. These 
methods include the steps of: 

a) isolating a double stranded DNA of an appropriate size for MS via amplification methods or formed by hybridi- 
zation of single-stranded DNA fragment; 

15 b) preparing the double-stranded DNA for analysis under conditions that increase the ratio of dsDNA:ssDNA in 

which the conditions include one or all of the following: preparing samples for analysis at reduced temperatures 
(i.e. 4 ' C), and using of higher DNA concentrations in the matrix to drive duplex formation 
c) ionizing/volatizing the sample of step b), where this step uses low acceleration voltage of the ions to assist in 
maintaining duplex DNA by, for example, adjusting laser power to just above threshold irradiation for ionization, and 

20 d) detecting the presence of the dsDNA of the appropriate mass. 

In preferred embodiments, the matrix includes 3-hydroxypicolinic acid. The detected DNA can be indicative of a genetic 
disorder, genetic disease, genetic predisposition to a disease chromosomal abnormalities. In other embodiments, the 
mass of the double stranded DNA is indicative of the deletion, insertion, mutation. 

25 [0030] A method designated primer oligo base extension (PROBE) is provided. This method uses a single detection 
primer followed by an oligonucleotide extension step to give products, which can be readily resolved by MALDI-TOF 
mass spectrometry. The products differ in length by a number of bases specific for a number of repeat units or for 
second site mutations within the repeated region. The method is exemplified using as a model system the AluVpA 
polymorphism in intron 5 of the interferon-a receptor gene located on human chromosome 21 , and the poly T tract of 

30 the splice acceptor site of intron 8 from the CFTR gene located on human chromosome 7. The method is advanta- 
geously used for example, for determining identity, identifying mutations, familial relationship, HLA compatability and 
othersuch markers .using PROBE-MS analysis of microsatellite DNA. In a preferred embodiment, the method includes 
the steps of: 

35 a) obtaining a biological sample from two individuals; 

b) amplifying a region of DNAfrom each individual that contains two or more microsatellite DNA repeat sequences 

c) ionizing/volatizing the amplified DNA; 

d) detecting the presence of the amplified DNA and comparing the molecular weight of the amplified DNA. Different 
sizes are indicative of non-identity (Le. wild-type versus mutation), non-heredity or non-compatibility; similar size 

40 fragments indicate the possibility identity, of familial relationship, or HLA compatibility. 

[0031] More than one marker may be examined simulataneoulsy, primers with different linker moieties are used for 
immobilization. 

[0032] Another method loop-primer oligo base extension, designated LOOP-PROBE, for detection of mutations es- 
45 peclally predominant disease causing mutations or common polymorphisms is provided. In a particular embodiment, 
this method for detecting target nucleic acid in a sample, includes the steps of: 

a) amplifying a target nucleic acid sequence, such as p-globin, in a sample, using (i) a first primer whose 5'-6nd 
shares identity to a portion of the target DNA immediately downstream from the targeted codon followed by a 
50 sequence that introduces a unique restriction endonuclease site, such as Cfol in the case of p-globin, into the 

amplicon and whose 3'-end primer is self-complementary; and (ii) a second downstream primer that contains a 
tag, such as biotin, for immobilizing the DNAto a solid support, such as streptavidin beads; 

c) immobilizing the double-stranded amplified DNA to a solid support via the linker moiety; 

d) denaturing the immobilized DNA and isolating the non-immobilized DNA strand; 

55 e) annealing the intracomplementary sequences in the 3'-end of the isolated non-immobilzed DNA strand, such 

that the 3'-end is extendable by a polymerase, which annealing can be performed, for example, by heating then 
and cooling to about 37* C, or other suitable method; 

f) extending the annealed DNA by adding DNA polymerase, 3 dNTPs/1 ddNTP, whereby the 3'-end of the DNA 
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strand is extended by the DMA polymerase to the position of the next ddNTP location (Le., to the mutation location); 
g) cleaving the extended double stranded stem loop DNAwith the unique restriction endonuclease and removing 
the cleaved stem loop DNA 

i) (optionally adding a matrix) ionizing/volatizing the extended product; and 
5 j) detecting the presence of the extended target nucleic acid, whereby the presence of a DNA fragment of a mass 

different from wild -type is indicative of a mutation at the target codon(s). 

This method eliminates one specific reagent for mutation detection compared other methods of MS mutational analyses, 
thereby simplifying the process and rendering it amenable to automation. Also, the specific extended product that is 

10 analyzed is cleaved from the primer and is therefore shorter compared to the other methods. In addition, the annealing 
efficiency is higher compared to annealing of an added primer and should therefore generate more product. The process 
is compatible with multiplexing and various detection schemes ( e.g. , single base extension, oligo base extension and 
sequencing). For example, the extension of the loop-primer can be used for generation of short diagnostic sequencing 
ladders within highly polymorphic regions to perform, for example, HLA typing or resistance as well as species typing. 

15 [0033] In another emodiment, a methods of detecting a target nucleic acid in a biological sample using RNA ampli- 
fication is provided. In the method, the target is amplified the target nucleic acid, using a primer that shares a region 
complementary to the target sequence and upstream encodes a promoter, such as the T7 promoter. A DNA-dependent 
RNA polymerase and appropriate ribonucleotides are added to synthesize RNA, which is analyzed by MS. 
[0034] Improved methods of sequencing DNA using MS are provided. In these methods thermocycling for amplifi- 

20 cation is used prior to MS analysis, thereby increasing the signal. 

[0035] Also provide are primers for use in MS analyses. In particular, primers, comprising all or, for longer oligonu- 
cleotides, at least about 20, preferably about 1 6, bases of any of the sequence of nucleotides sequences set forth in 
SEQ ID NOs. 1-22, 24, 27-38, 41-86, 89, 92, 95, 98, 101-110, 112-123, 126, 128, 129, and primers set forth in SEQ 
ID Nos. 280-287. The primers are unlabeled, and optionally include a mass modifying moiety, which is preferably 

25 attached to the 5'end. 

[0036] Other features and advantages of the methods provided herein will be further described with reference to the 
following Figures, Detailed Description and Claims. 

BRIEF DESCRIPTION OF THE FIGURES 

30 

[0037] FIGURE 1 A is a diagram showing a process for performing mass spectrometric analysis on one target de- 
tection site (TDS) contained within a target nucleic acid molecule (T), which has been obtained from a biological sample. 
A specific capture sequence (C) is attached to a solid support (SS) via a spacer (S). The capture sequence is chosen 
to specifically hybridize with a complementary sequence on the target nucleic acid molecule (T), known as the target 
35 capture site (TCS). The spacer (S) facilitates unhindered hybridization. A detector nucleic acid sequence (D), which 
is complementary to the TDS is then contacted with the TDS. Hybridization between D and the TDS can be detected 
by mass spectrometry. 

[0038] FIGURE IB is a diagram showing a process for perfomning mass spectrometric analysis on at least one target 
detection site (here TDS 1 and TDS 2) via direct linkage to a solid support. The target sequence (T) containing the 

40 target detection site (TDS 1 and TDS 2) is immobilized to a solid supportvia the formation of a reversible or irreversible 
bond formed between an appropriate functionality (L') on the target nucleic acid molecule (T) and an appropriate func- 
tionality (L) on the solid support. Detector nucleic acid sequences (here D1 and D2), which are complementary to a 
target detection site (TDS 1 or TDS 2) are then contacted with the TDS. Hybridization between TDS 1 and D1 and/or 
TDS 2 and D2 can be detected and distinguished based on molecular weight differences. 

45 [0039] FIGURE 1C Is a diagram showing a process for detecting a wildtype (D™*) and/or a mutant (D"^"*) sequence 
in a target (T) nucleic acid molecule. As in Figure 1 A, a specific capture sequence (C) is attached to a solid support 
(SS) via a spacer (S). In addition, the capture sequence is chosen to specifically interact with a complementary se- 
quence on the target sequence (T), the target capture site (TCS) to be detected through hybridization. If the target 
detection site (TDS) includes a mutation, X, etection sites can be distinguished from wildtype by mass spectrometry. 

50 Preferably, the detector nucleic acid molecule (D) is designed so that the mutation is in the middle of the molecule and 
therefore would not lead to a stable hybrid if the wildtype detector oligonucleotide (D^t) is contacted with the target 
detector sequence, e.g. , as a control. The mutation can also be detected if the mutated detector oligonucleotide (D^nut) 
with the matching base at the mutated position is used for hybridization. If a nucleic acid molecule obtained from a 
biological sample is heterozygous for the particular sequence (i.e. contain D^t and D^""^), D^t and D"""* will be bound 

55 to the app and D"*"*^ to be detected simultaneously. 

[0040] FIGURE 2 is a diagram showing a process in which several mutations are simultaneously detected on one 
target sequence molecular weight differences between the detector oligonucleotides D1, D2 and D3 must be large 
enough so that simultaneous detection (multiplexing) is possible. This can be achieved either by the sequence itself 
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(composition or length) or by tlie introduction of mass-modifying functionalities M 1 -M3 into tlie detector oligonucleotide. 
[0041] FIGURE 3 is a diagram sliowing still anctlier multiplex detection format. In this embodiment, differentiation 
is accomplished by employing different specific capture sequences which are position-specifically immobilized on a 
flat surface ( e.g. , a 'chip array"). If different target sequences T1-Tn are present, their target capture sites TCSI-TCSn 
5 will interact with complementary immobilized capture sequences C1-Cn. Detection is achieved by employing appro- 
priately mass differentiated detector oligonucleotides D1 -Dn, which are mass differentiated either by their sequences 
or by mass modifying functionalities M1-Mn. 

[0042] FIGURE 4 is a diagram showing a format wherein a predesigned target capture site (TCS) is incorporated 
into the target sequence using nucleic acid (Le., PGR) amplification. Only one strand Is captured, the other is removed 

10 ( e.g. , based on the interaction between biotin and streptavidin coated magnetic beads). If the biotin is attached to 
primer 1 theotherstrand can be appropriately marked by aTCS. Detection is as described abovethrough the interaction 
of a specific detector oligonucleotide D with the corresponding target detection site TDS via mass spectrometry. 
[0043] FIGURES is a diagram showing how amplification (here ligase chain reaction (LCR)) products can be prepared 
and detected by mass spectrometry. Mass differentiation can be achieved by the mass modifying functionalities (IVI1 

15 and M2) attached to primers (P1 and P4 respectively). Detection by mass spectrometry can be accomplished directly 
(i.e. without employing immobilization and target capturing sites (TCS)). Multiple LCR reactions can be performed in 
parallel by providing an ordered array of capturing sequences (C). This format allows separation of the ligation products 
and spot by spot identification via mass spectrometry or multiplexing if mass differentiation is sufficient. 
[0044] FIGURE 6A is a diagram showing mass spectrometric analysis of a nucleic acid molecule, which has been 

20 amplified by a transcription amplification procedure. An RNA sequence is captured via its TCS sequence, so that 
wildtype and mutated target detection sites can be detected as above by employing appropriate detector oligonucle- 
otides (D). 

[0045] FIGURE 6B is a diagram showing multiplexing to detect two different (mutated) sites on the same RNA in a 
simultaneous fashion using mass-modified detector oligonucleotides M1-D1 and M2-D2. 
25 [0046] FIGURE 6C is a diagram of a different multiplexing procedure for detection of specific mutations by employing 
mass modified dideoxynucleoside or 3'-deoxynucleoside triphosphates and an RNA dependent DNA polymerase. Al- 
ternatively, DNA dependent RNA polymerase and ribonucleotide phosphates can be employed. This format allows for 
simultaneous detection of all four base possibilities at the site of a mutation (X). 

[0047] FIGURE 7A is a diagram showing a process for performing mass spectrometric analysis on one target de- 
30 tection site (TDS) contained within a target nucleic acid molecule (T), which has been obtained from a biological sample. 
A specific capture sequence (C) is attached to a solid support (SS) via a spacer (S). The capture sequence is chosen 
to specifically hybridize with a complementary sequence on T known as the target capture site (TCS). A nucleic acid 
molecule that is complementary to a portion of the TDS is hybridized to the TDS 5' of the site of a mutation (X) within 
the TDS. The addition of a complete set of dideoxynucleosides or 3'-deoxynucleoside triphosphates ( e.g. , pppAdd, 
35 pppTdd, pppCdd and pppGdd) and a DNA dependent DNA or RNA polymerase allows for the addition only of the one 
dideoxynucleoside or 3'-deoxynucleoside triphosphate that is complementary to X. 

[0048] FIGURE 7B is a diagram showing a process for performing mass spectrometric analysis to determine the 
presence of a mutation at a potential mutation site (M) within a nucleic acid molecule. This format allows for simulta- 
neous analysis of alleles (A) and (B) of a double stranded target nucleic acid molecule, so that a diagnosis of ho- 

40 mozygous normal, homozygous mutant or heterozygous can be provided. Allele A and B are each hybridized with 
complementary oligonucleotides ((C) and (D) respectively), that hybridize to A and B within a region that includes M. 
Each heteroduplex is then contacted with a single strand specific endonuclease, so that a mismatch at M, indicating 
the presence of a mutation, results in the cleavage of (C) and/or (D), which can then be detected by mass spectrometry. 
[0049] FIGURE 8 is a diagram showing how both strands of a target DNA can be prepared for detection using tran- 

45 scriptlon vectors having two different promoters at opposite locations ( e.g. , the SP6 and T7 promoter). This format is 
particularly useful for detecting heterozygous target detections sites (TDS) . Employ ingtheSP6ortheT7 RNA polymer- 
ase both strands could be transcribed separately orsimultaneously. The transcribed RNA molecules can be specifically 
captured and simultaneously detected using appropriately mass-differentiated detector oligonucleotides. This can be 
accomplished either directly in solution or by parallel processing of many target sequences on an ordered array of 

50 specifically immobilized capturing sequences. 

[0050] FIGURE 9 is a diagram showing how RNA prepared as described in Figures 6, 7 and 8 can be specifically 
digested using one or more ribonucleases and the fragments captured on a solid support carrying the corresponding 
complementary sequences. Hybridization events and the actual molecular weights of the captured target sequences 
provide information on whether and where mutations in the gene are present. The array can be analyzed spot by spot 

55 using mass spectrometry. DNA can be similarly digested using a cocktail of nucleases including restriction endonucle- 
ases. Mutations can be detected by different molecular weights of specific, individual fragments compared to the mo- 
lecular weights of the wildtype fragments. 

[0051] FIGURE 10A shows UV spectra resulting from the experiment described in the following Example 1 . Panel 
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i) shows the absorbance of the 26-mer before hybridization. Panel ii) shows the filtrate of the centrifugation after hy- 
bridization. Panel iii) shows the results after the first wash with 50 mM annmoniunn citrate. Panel iv) shows the results 
after the second wash with 50 mM amnnoniunn citrate. 

[0052] FIGURE 10B shows a mass spectrum resulting from the experiment described in the following Example 1 
after three washing/centrifugation steps. 

[0053] FIGURE 10C shows a mass spectrum resulting fronn the experinnent described in the following Example 1 
showing the successful desorption of the hybridized 26-m6r off of beads in accordance with the fomnat depicted sche- 
nnaticaliy in Figure 1 B. 

[0054] FIGURE 11 shows a mass spectrum resulting from the experiment described in the following Example 1 
showing the giving proof of an experiment as schematically depicted in FIGURE 18 successful desorption of the hy- 
bridized 40-mer. The efficiency of detection suggests that fragments much longer than 40-mers can also be desorbed. 
Figure 12 shows a mass spectrum resulting from the experiment described in the following Example 2 showing the 
successful desorption and differentiation of an 18-mer and 19-mer by electrospray mass spectrometry, the mixture 
(top), peaks resulting from 18-mer emphasized (middle) and peaks resulting from 19-mer emphasized (bottom) 
[0055] FIGURE 13 Is a graphic representation of the process for detecting the Cystic Fibrosis mutation AF508 as 
described in Example 3. 

[0056] FIGU RE 1 4 is a mass spectrum of the DN A extension product of a AF508 homozygous normal of Example 3. 
[0057] FIGURE 15 is a mass spectrum of the DNA extension product of a AF508 heterozygous mutant of Example 3. 
[0058] FIGU RE 1 6 is a mass spectrum of the DNA extension product of a AF508 homozygous normal of Example 3. 
[0059] FIGU RE 1 7 Is a mass spectrum of the DNA extension product of a AF508 homozygous mutant of Example 3. 
[0060] FIGURE 18 Is a mass spectrum of the DNA extension product of a AF508 heterozygous mutant of Examples. 
[0061] FIGURE 19 is a graphic representation of various processes for performing apolipoprotein E genotyping of 
Example 4. 

[0062] FIGU RE 20 shows the nucleic acid sequence of normal apolipoprotein E (encoded by the E3 allele, FIG. 20B) 
and other isotypes encoded by the E2 and E4 alleles (FIG. 20A). 

[0063] FIGURE 21 A shows a composite restriction pattern for various genotypes of apolipoprotein E using the Cfol 
restriction endonuclease. 

[0064] FIGURE 21 B shows the restriction pattern obtained in a 3.5% MetPhor Agarose Gel for various genotypes 
of apolipoprotein E. 

[0065] FIGURE 21 C shows the restriction pattern obtained in a 12% polyacrylamide gel for various genotypes of 
apolipoprotein E. 

[0066] FIGURE 22A is a chart showing the molecular weights of the 91, 83, 72, 48 and 35 base pair fragments 
obtained by restriction enzyme cleavage of the E2, E3 and E4 alleles of apolipoprotein E. 

[0067] FIGURE 22B is the mass spectrum of the restriction product of a homozygous E4 apolipoprotein E genotype. 
[0068] FIGURE 23A is the mass spectrum of the restriction product of a homozygous E3 apolipoprotein E genotype. 
[0069] FIGURE 238 is the mass spectrum of the restriction product of a E3/E4 apolipoprotein E genotype. 
[0070] FIGURE 24 is an auto radio graph of Example 5 of a 7.5% polyacrylamide gel in which 10% (5)0.1) of each 
amplified sample was loaded: sample M: pBR322 A/ti/digested; sample 1 : HBV positive in serological analysis; sample 
2: also HBV positive; sample 3 : without serological analysis but with an increased level of transaminases, indicating 
liver disease; sample 4 : HBV negative containing HCV; sample 5 : HBV posit-) negative control; (+) positive control). 
Staining was done with ethidium bromide. 

[0071] FIGURE 25A Is a mass spectrum of sample 1 , which is HBV positive. The signal at 20754 Da represents the 
HBV related amplification product (67 nucleotides, calculated mass: 20735 Da). The mass signal at 10390 Da repre- 
sents the [M+2HF+ molecule ion (calculated: 1 0378 Da). 

[0072] FIGURE 25B Is a mass spectrum of sample 3, which is HBV negative corresponding to nucleic acid (i.e. , 
PGR), serological and dot blot based assays. The amplified product is generated only in trace amounts. Nevertheless 
it Is unambiguously detected at 20751 Da (calculated mass: 20735 Da). The mass signal at 10397 Da represents the 
[M+2H]2+ molecule ion (calculated: 10376 Da). 

[0073] FIGURE 25C is a mass spectrum of sample 4, which is HBV negative, but HCV positive. No HBV specific 
signals were observed. 

[0074] FIGURE 26 shows a part of the E. coli lac\ gene with binding sites of the complementary ollgonucelotides 
used in the ligase chain reaction (LOR) of Example 6. Here the wildtype sequence is displayed. The mutant contains 
apolntmutatlonatbp 191 which Isalsotheslte of ligation (bold). The mutation is a C to T transition (Gto A, respectively). 
This leads to a T-G mismatch with ollgo B (and A-C mismatch with oligo C, respectively). 

[0075] FIGURE 27 is a 7.15% polyacrylamide gel of Example 6 stained with ethidium bromide. M: chain length 
standard (pUC19DNA, Msp\ digested). Lane 1: LCR with wildtype template. Lane 2: LCRwith mutant template. Lane 
3: (control) LCR without template. The ligation product (50 bp) was only generated in the positive reaction containing 
wildtype template. 
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[0076] FIGURE 28 is an HPLC chromatogram of two pooled positive LCRs. 

[0077] FIGURE 29 shows an HPLC chromatogram the same conditions but mutant template were used. The small 
signal of the ligation product is due to either template-free ligation of theeducts orto a ligation ata (G-T, A-C) mismatch. 
The 'false positive' signal is significantly lower than the signal of ligation product with wildtype template depicted in 
5 Figure 28. The analysis of ligation educts leads to 'double-peaks' because two of the oligonucleotides areS'-phospho- 
rylated. 

[0078] FIGURE 30 In (b) the complex signal pattern obtained by MALDI-TOF-MS analysis of Pft/DNA-ligase solution 
of Example 6 Is depicted. In (a) a MALDI-TOF-spectrum of an unpurified LCR is shown. The mass signal 67569 Da 
probably represents the Plu DNA ligase. 

10 [0079] FIGURE 31 shows a MALDI-TOF spectrum of two pooled positive LCRs (a). The signal at 7523 Da represents 
unligated oligo A (calculated: 7521 Da) whereas the signal at 15449 Da represents the ligation product (calculated: 
15450 Da). The signal at 3774 Da is the [M + 2H]2+ signal of oligo A. The signals in the mass range lower than 2000 
Da are due to the matrix ions. The spectrum corresponds to lane 1 in figure 27 and the chromatogram in figure 28. In 
(ta) a spectrum of two pooled negative LCRs (mutant template) is shown. The signal at 7517 Da represents oligo A 

15 (calculated: 7521 Da). 

[0080] FIGURE 32 shows a spectrum of two pooled control reactions (with salmon spemn DNA as template). The 
signals in the mass range around 2000 Da are due to Tween20, only oligo A could be detected, as expected. 
[0081] FIGURE 33 shows a spectrum of two pooled positive LCRs (a). The purification was done with a combination 
of ultrafiltration and streptavidin DynaBeads as described in the text. The signal at 15448 Da represents the ligation 
20 product (calculated: 15450 Da). The signal at 7527 represents oligo A (calculated: 7521 Da). The signals at 3761 Da 
is the [M-i-2H]2+ signal of oligo A, whereas the signal at 5140 Da is the [M+3H]2+ signal of the ligation product. In (b) 
a spectrum of two pooled negative LCRs (without template) is shown. The signal at 7514 Da represents oligo A (cal- 
culated: 7521 Da). 

[0082] FIGURE 34 is a schematic presentation of the oligo base extension of the mutation detection primer as de- 
25 scribed in Example 7, using ddTTP (A) or ddCTP (B) in the reaction mix, respectively. The theoretical mass calculation 
is given in parenthesis. The sequence shown is part of the exon 10 of the CFTR gene that bears the most common 
cystic fibrosis mutation AF508 and more rare mutations AI507 as well as lle506Ser. 

[0083] FIGURE 35 is a MALDI-TOF-MS spectrum recorded directly from precipitated oligo base extended primers 
for mutation detection. The spectrum in (A) and (B), respectively show the annealed primer (CF508) without further 

30 extension reaction. Panel C displays the MALDI-TOF spectrum of the wild type by using pppTdd in the extension 
reaction and D a heterozygotic extension products carrying the 506S mutation when using pppCdd as temninator 
Panels E and F show a heterozygote with AF508 mutation with pppTdd and pppCdd as terminators in the extension 
reaction. Panels G and H represent a homozygous AF508 mutation with either pppTdd or pppCdd as terminators. The 
template of diagnosis is pointed out below each spectrum and the observed/expected molecular mass are written in 

35 parenthesis. 

[0084] FIGURE 36 shows the portion of the sequence of pRFcl DNA, which was used as template for nucleic acid 
amplification in Example 8 of unmodified and 7-deazapurine containing 99-mer and 200-mer nucleic acids as well as 
the sequences of the 1 9-mer fonward primer and the two 1 8-mer reverse primers. 

[0085] FIGURE 37 shows the portion of the nucleotide sequence of M13mp1 8 RFI DNA, which was used in Example 
40 8 for nucleic acid amplification of unmodified and 7-deazapurine containing 103-mer nucleic acids. Also shown are 
nucleotide sequences of the 1 7-mer primers used in the nucleic acid amplification reaction. 

[0086] FIGURE 38 shows the result of a polyacrylamide gel electrophoresis of amplified products described In Ex- 
ample 8 purified and concentrated for MALDI-TOF MS analysis. M: chain length marker, lane 1 : 7-deazapurine con- 
taining 99-mer amplified product, lane 2: unmodified 99-mer, lane 3: 7-deazapurine containing 103-mer and lane 4: 

45 unmodified 103-mer amplified product. 

[0087] FIGURE 39: an autoradiogram of polyacrylamide gel electrophoresis of nucleic acid ( i.e. , PGR) reactions 
carried out with 5'-[32p]-iabeled primers 1 and 4. Lanes 1 and 2: unmodified and 7-deazapurine modified 103-mer 
amplified product (53321 and 23520 counts), lanes 3 and 4: unmodified and 7-d6azapurine modified 200-mer (71123 
and 39582 counts) and lanes 5 and 6: unmodified and 7-d6azapurin6 modified 99-mer (17321 6 and 94400 counts). 

50 [0088] FIGURE 40 a) MALDI-TOF mass spectrum of the unmodified 103-mer amplified products (sum of twelve 
single shot spectra). The mean value of the masses calculated for the two single strands (31768 u and 31759 u) is 
31763 u. Mass resolution: 18. b) MALDI-TOF mass spectrum of 7-deazapurine containing 1 03-mer amplified product 
(sum of three single shot spectra). The mean value of the masses calculated for the two single strands (31 727 u and 
31 71 9 u) is 31 723 u. Mass resolution: 67. 

55 [0089] FIGURE41: a) MALDI-TOF mass spectrum of the unmodified 99-mer amplified product (sum of twenty single 
shot spectra). Values of the masses calculated for the two single strands: 30261 u and 30794 u. b) MALDI-TOF mass 
spectrum of 7-deazapurine containing 99-mer amplified product (sum of twelve single shot spectra). Values of the 
masses calculated for the two single strands: 30224 u and 30750 u. 
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[0090] FIGURE 42: a) MALDI-TOF mass spectrum of the unmodified 200-mer amplified product (sum of 30 single 
shot spectra). The mean value of the masses calculated for the two single strands (61873 u and 61595 u) is 61734 u. 
Mass resolution: 28. b) MALDI-TOF mass spectrum of 7-deazapurine containing 200-mer amplified product (sum of 
30 single shot spectra). The mean value of the masses calculated for the two single strands (61772 u and 61714 u) is 

5 61643 u. Mass resolution: 39. 

[0091] FIGURE 43: a) MALDI-TOF mass spectrum of 7-deazapurine containing 100-mer amplified product with ri- 
bomodified primers. The mean value of the masses calculated for the two single strands (30529 u and 31095 u) is 
3081 2 u. b) MALDI-TOF mass spectrum of the amplified product after hydrolytic primer-cleavage. The mean value of 
the masses calculated for the two single strands (251 04 u and 25229 u) is 251 67 u. The mean value of the cleaved 

10 primers (5437 u and 5918 u) is 5677 u. 

[0092] FIGURE44 A-D shows the MALDI-TOF mass spectrum of the four sequencing ladders obtained from a 39-mer 
template (SEQ ID No. 23), which was immobilized to streptavidin beads via a 3' biotinylation. A 14-mer primer (SEQ 
ID NO. 24) was used in the sequencing according to Example 9. 

[0093] FIGURE 45 shows a MALDI-TOF mass spectrum of a solid phase sequencing of a 78 -mer template (SEQ ID 
15 No. 25), which was immobilized to streptavidin beads via a 3' biotinylation. A 18-mer primer (SEQ ID No. 26) and 
ddGTP were used in the sequencing. 

[0094] FIGURE 46 shows a scheme in which duplex DNA probes with single-stranded overhang capture specific 
DNA templates and also serve as primers for solid phase sequencing. 

[0095] FIGURE 47 A-D shows MALDI-TOF mass spectra obtained from a sequencing reaction using 5' fluorescent 
20 labeled 23-mer (SEQ ID No. 29) annealed to a 3' biotinylated IS-mer (SEQ ID No. 30), leaving a 5-bas6 overhang, 
which captured a 15-mer template (SEQ ID No. 31) as described in Example 9. 

[0096] FIGURE 48 shows a stacking fluorogram of the same products obtained from the reaction described in FIG- 
URE 47, but run on a conventional DNA sequencer 

[0097] FIGURE 49 shows a MALDI-TOF mass spectrum of the sequencing ladder using cycle sequencing as de- 
25 scribed in Example 1 generated from a biological amplified product as template and a 12mer (5'-TGC AGO TGA GTC- 
3' (SEQ ID NO. 34)) sequencing primer. The peaks resulting from depurinations and peaks which are not related to 
the sequence are marked by an asterisk. MALDI-TOF MS measurements were taken on a reflectron TOF MS. A.) 
Sequencing ladder stopped with ddATP; B.) Sequencing ladder stopped with ddCTP; C.) Sequencing ladder stopped 
with ddGTP; D.) Sequencing ladder stopped with ddTTP. 
30 [0098] FIGURE 50 shows a schematic representation of the sequencing ladder generated in Fig. 49 with the corre- 
sponding calculated molecular masses up to 40 bases after the primer. For the calculation, the following masses were 
used: 3581 .4Da for the primer, 312.2 Da for 7-d6aza-dATP, 304.2 Da for dTTP, 289.2 Da for dCTP and 328.2 Da for 
7-deaza-dGTP. 

[0099] FIGURE 51 shows the sequence of the amplified 209 bp amplified product within the p-globin gene, which 
35 was used as a template for sequencing. The sequences of the appropriate amplification primer and the location of the 
12m6r sequencing primer is also shown. This sequence represents a homozygote mutant at the position 4 bases after 
the primer. In a wildtype sequence this T would be replaced by an A. 

[0100] FIGURE 52 shows a sequence which is part of the intron 5 of the interferon -receptor gene that bears the 
AluVpA polymorphism as further described in Example 11. The scheme presents the primer oligo base extension 
40 (PROBE) using ddGTP, ddCTP, or both for termination, respectively. The polymorphism detection primer (I FN) is un- 
derlined, the termination nucleotides are marked in bold letters. The theoretical mass values from the alleles found in 
28 unrelated individuals and a five member family are given in the table. Both second site mutations found in most 13 
units allele, but not all, are indicated. 

[0101] FIGURE 53 shows the MALDI-TOF-MS spectra recorded directly fomn precipitated extended cyclePROBE 
45 reaction products. Family study using AluVpA polymorphism in intron 5 of the interferon-a receptor gene (Example 11). 
[0102] FIGURE 54 shows the mass spectra from PROBE products using ddC as termination nucleotide in the reaction 
mix. The allele with the molecular mass of approximately 1 1 650 da from the DNA of the mother and child 2 is a hint to 
a second site mutation within one of the repeat units. 

[0103] FIGURE 55 shows a schematic presentation of the PROBE method for detection of different alleles in the 
50 polyT tract at the 3'-end of intron 8 of the CFTR gene with pppCdd as terminator (Example 11). 

[0104] FIGURE 56 shows the MALDI-TOF-MS spectra recorded directly from the precipitated extended PROBE 
reaction products. Detection of all three common alleles of the polyT tract at the 3' end of Intron 8 of the CFTR gene, 
(a) T5/T9 heterozygous, (b) T7/T9 heterozygous (Example 11). 

[0105] FIGURE 57 shows a mass spectrum of the digestion of a 252-mer ApoE gene amplified product (e3/e3 
55 genotype) as described in Example 1 2 using a) Cfol alone and b) Cfol plus Rsal. Asterisks: depurination peaks. 

[0106] FIGURE 58 shows a mass spectrum of the ApoE gene amplified product (G3/G3 genotype) digested by Cfol 
and purified by a) single and b) double ethanol/glycogen and c) double isopropyl alcohol/glycogen precipitations. 
[0107] FIGURE 59 shows a mass spectrum of the Cfol/Rsal digest products from a) G2/G3, b) G3/G3, c) G3/G4, 
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and d) G4/G4 genotypes. Dashed lines are drawn through diagnostic fragments. 

[0108] FIGURE 60 shows a scheme for rapid identification of unknown ApoE genotypes following simultaneous 
digestion of a 252-mer apo E gene amplified product by the restriction enzymes Cfol and Rsal. 
[0109] FIGURE 61 shows the multiplex (codons 112 and 158) mass spectrum PROBE results for a) G2/G3, b) G3/G3, 
5 c) G3/G4, and d) G4/G4 genotypes. E: extension products; P: unextended primer. Top: codon 112 and 158 regions, 
with polymorphic sites bold and primer sequences underlined. 

[0110] FIGURE 62 shows a mass spectrum of a TRAP assay to detect te I omerase activity (Example 13). The spec- 
trum shows two of the primer signals of the amplified product TS primer at 5,497.3 Da (calc. 5523 Da) and the bioti- 
nylated bioCX primer at 7,537.6 Da (calc. 7,537 Da) and the first telomerase-specific assay product containing three 
10 telomeric repeats at 12,775.8 Da (calc. 12,452 Da) its mass is largerby one dA nucleotide (12,765 Da) due to extendase 
activity of Taq DNA polymerase. 

[0111] FIGURE 63 depicts the higher mass range of FIGURE 62, i.e. the peakat 12,775.6 Da represents the products 
with these telomeric repeats. The peaks at 20,322.1 Da is the result of a telomerase activity to form seven telomeric 
repeats (calc. 20,395 Da including the extension by one dA nucleotide). The peaks marked 1,2,3 and 4 contain a four 
15 telomeric repeats at 14,674 Da as well as secondary ion product. 

[0112] FIGURE 64 displays a MALDI-TOF spectrum of the RT-amplified product of the human tyrosine hydroxylase 
mRNA indicating the presence of neuroblastoma cells (Example 14). The signal at 18,763.8 Da represents the non- 
biotinylated single-stranded 61 mer of the nested amplified product (calc. 18,758.2 Da). 

[0113] FIGURE 65 (a) shows a schematic representation of a PROBE reaction for the RET proto-onccgene with a 
20 mixture of dATP, dCTP, dGTP, and ddTTP (Example 15). B represents biotin, through which the sense template strand 
is bound through streptavidin to a solid support. Figure 65(b) shows the expected PROBE products for ddT and ddA 
reactions for wildtype, C^T, and C^A antisense strands. 

[0114] FIGURE 66 shows the PROBE product mass spectra for (a) negative control, (b) Patient 1 being heterozygote 

(Wt/G^T) and (c) Patient 2 being heterozygote (Wt/C^A), reporting average values. 
25 [0115] FIGURE 67 shows the MALDI-FTMS spectra for synthetic analogs representing ribo-cleaved RET proto- 

oncogene amplified products from (a) wildtype, (b) G->A, and (c) G^T homozygotes, and (d) wildtype/G^A, (e) 

wildtype/G^T, and (f) G->A/G^T heterozygotes, reporting masses of most abundant isotope peaks. 

[01 1 6] FIGU RE 68 is a schematic representation of nucleic acid immobilization via covalent bifunctional trityl linkers. 

[0117] FIGURE 69 is a schematic representation of nucleic acid immobilization via hydrophobic trityl linkers. 
30 [0118] FIGURE 70 shows a MALDI-TOF mass spectrum of a supernatant of the matrix treated Dynabeads containing 

bound oligo (5'-iminobiotin - TGCAGGTGAGTG, SEQ ID NO. 56). An internal standard (GTGTGGTCGTGC, SEQ ID 

NO. 57) was included in the matrix. 

[0119] FIGURE 71 shows a MALDI-TOF mass spectrum of a supernatant of the matrix treated Dynabeads containing 
bound oligo (5'-iminobiotin - TGCAGGTGAGTG, SEQ ID NO. 56). An internal standard (GTGTGGTCGTGC, SEQ ID 
35 NO. 57) was included in the matrix. 

[0120] FIGURE 72 schematically depicts the steps involved with the Loop-primer oligo base extension (Loop-probe) 
reaction. 

[0121] FIGURE 73A shows a MALDI-TOF mass spectrum of a supernatant after Gfol digest of a stem loop. Figure 
73B-D show MALDI-TOF mass spectrum of different genotypes: HbA the wildtype genotype (74B), HbC, a mutation 
40 of codon 6 of the p-globin gene which causes sickle cell disease (740), and HbS, a different mutation of codon 6 of 
the p-globin gene which causes sickle cell disease (74D). 

[0122] FIGURE 74 shows the nucleic acid sequence of the amplified region of CKR-5. The underlined sequence 
corresponds to the region homologous to the amplification primers. The dotted region corresponds to the 32 bp deletion. 
[0123] FIGURE 75 shows the sense primer ckrT7f. Being designed to facilitate binding of T7-Rrsl A polymerase and 
45 amplification of the CKR-5 region to be analyzed, it starts with a randomly chosen sequence of 24 bases, the T7 
promoter sequence of 18 bases and the sequence homologous to CKR-5 of 19 bases. 

[0124] FIGURE 76 is a MALDI-TOF mass spectrum of the CKR-5 amplification product, which was generated as 
described in the following Example 21 . 

[0125] FIGURE 77 is a positive ion UV-MALDI mass spectra of a synthetic RNA 25-m6r (5'-UCCGGUCUGAU- 
50 GAGUCCGUGAGGAC-3' SEQ ID NO. 62) digested with selected RNAses. For each enzyme 0.6^lI aliquots of teh 4.5p,l 
assay containing a total of ca. 20 pmol of the RNA were fixed with 1 .S^lI matrix (3-HPA) for analysis. Fragments with 
retained 5'-terminus are marked by different arrows, specific for the different RNAses, (Hahner et al.. Proceedings of 
the 44*^1 ASMS Conference on Mass Spectrometry and Allied Topics, p. 983 (1996)). 

[0126] FIGURE 78 is an investigation of the specificity of the RNAses CL3 and Cusativin by positive ion UV-MALDI 
55 mass spectra of a synthetic RNA 20-mer. Expected and/or observed cleavage sites are indicated by arrows. A, B, C 
indicate correct cleavage sites and corresponding singly cleaved fragments. Missing cleavages are designated by a 
question mark (?), unspecific cleavages by an X. 

[0127] FIGURE 79 shows the separation of a mixture of DNA molecules (12-mer, 5'-biot. 19-mer, 22-mer and 5'-biot. 
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27-nner) with streptavidin-coated magnetic beads, a) positive ion UV-IVIALDI mass spectrum of 0.6)il of a mixture con- 
taining ca. 2-4 pmol of eacli species mixed with 1 .5^1 matrix (3-HPA). b) same as a) but incubation of the mixture with 
magnetic beads and subsequent release of the captured fragments. 

[0128] FIGURE 80 Elution of immobilized 5' biotinylated 49 nt in wfrotranscriptfrom the streptavidin-coated magnetic 
5 beads. Positive UV-MALDI mass spectrum of the transcript prior to incubation with the magnetic beads (a). Spectra of 
the immobilized RNA transcript after elution with 95% formamide alone (b) and with various additives such as 10mM 
EDTA (c), 10mM CDTA (d) and 25% ammonium hydroxide (e); EDTA and CDTA were adjusted with 25% ammonium 
hydroxide to a pH of 8. 

[0129] FIGURE 81 Positive UV-MALDI mass spectra of the 5' biotinylated 49 nt in vitro transcript after RNAse U2 
10 digest for 15 minutes, a) Spectrum of the 25 ul assay containing ca. 100 pmol of the target RNA before separation; b) 
spectrum after isolation of the 5'-biotinylated fragments with magnetic beads. Captured fragments were released by a 
solution of 95% formamide containing 10 mM CDTA. 1 ul aliquots of the samples were mixed with 1 .5 ul matrix (3-HPA) 
in both cases. 

[0130] FIGURE 82 schematically depicts detection of putative mutations in the human p-globin gene at codon 5 and 
15 6 and at codon 30, and the IVS-1 donor site, respectively, done in parallel. FIGURE 82A shows amplification of genomic 
DNA using the primers p2 and p11 . The location of the primers and identification tags as well as an indication of the 
wild type and mutant sequences are shown. FIGURE 82B shows analysis of both sites in a simple Primer Reaction 
Oligo Base Extension (PROBE) using primers p-TAGI (which binds upstream of codon 5 and 6) and p-TAG2 (which 
binds upstream of codon 30 and the IVS-1 donor site). Reaction products are captured using streptavidin-coated par- 
20 amagnetic particle bound biotinylated capture primers (cap-tag-1 and cap-tag-2, respectively), that have 6 bases at 
the 5' end that are complementary to the 5' end of p-TAG1 and p-TAG2, respectively, and a portion which binds to a 
universal primer. 

[0131] FIGURE 83 shows a mass spectrum of the PROBE products of a DNA sample from one individual analyzed 
as described schematically in FIGURE 82. 

25 [0132] FIGURE 84 shows a mass spectrum of the sequence bound to cap-tag-2. 

[0133] FIGURE 85 shows a mass spectrum obtained by using the p-TAGI and p-TAG 2 primers in one sequencing 
reaction using ddATP for termination and then sorting according to the method depicted in FIGURE 82. 
[0134] FIGURE 86 shows a mass spectrum obtained by using the p-TAGi and P-TAG2 primers in one sequencing 
reaction using ddCTP for termination and then sorting according to the method depicted in FIGURE 82. 

30 [0135] FIGURE 87Ashowsthe wildtype sequence of a fragment of the chemokine receptorCKR-5 gene with primers 
(bold) used for amplification. The 32 base pair (bp) deletion in the GKR-5 allele is underlined; and the stop nucleotides 
are in italic. In FIGURE 87B, the wildtype strands are depicted with and without an added Adenosine, their length and 
molecular masses are indicated. FIGURE 87C indicates the same for the 32 bp deletion. FIGURE 87D shows the 
PROBE products for the wildtype gene and FIGURE 87E shows the mutated allele. 

35 [0136] FIGURE 88 shows the amplification products of different unrelated individuals as analyzed by native polyacr- 
ylamide gel electrophoreses (15%) and silver stain. The band corresponding to a wildtype CKR-5 runs at 75 bp and 
the band from the gene with the deletion at 43 bp. Bands bigger than 75 bp are due to unspecific amplification. 
[0137] FIGURE 89A shows a spectrograph of DNA derived from a heterozygous individual: the peak with a mass of 
2331 9 Da corresponds to the wildtype CKR-5 and the peaks with masses of 13137 Da and 13451 Da to the deletion 

40 allele with and without an extra Adenosine, respectively. FIGURE 89B shows a spectrograph of DNA obtained from 
the same individual as in FIGURE 89A, but the DNA was treated with T4 DNA polymerase to remove the added Ade- 
nosine. FIGURES 89C and 89D are spectrographs derived from homozygous individuals and in FIGURE 89D, the 
Adenosine has been removed. All peaks with masses lower than 13000 Da are due to multiple charged molecules. 
[0138] FIGURE 90A shows the mass spectrum of the results of a PROBE reaction performed on DNA obtained from 

45 a heterozygous individual. FIGURE 90B shows a mass spectrum of the results of a PROBE reaction on a homozygous 
individual. The peaks with masses of 6604 Da and 6607 Da, respectively correspond to the wildtype allele, and the 
peak with a mass of 6275 Da to the deletion allele. The primer is detected with a mass of 5673 and 5676 Da, respectively. 
[0139] FIGURE 91 shows a MALDI-TOF MS spectra of a thermocycling primer Oligo Base Extension (tc-PROBE) 
reaction as described in Example 24 using three different templates and 5 different PROBE primers simultaneously in 

50 one reaction. 

[0140] FIGURE 92 schematically depicts a single tube process for amplifying and sequencing exons 5-8 of the p53 
gene as described in Example 25. The mass spectrum is the A reaction of Figure 93. 

[0141] FIGURE 93 shows a superposition plot of four separate reactions for sequencing a portion of exon 7 of the 
p53 gene as described in Example 25. 
55 [0142] FIGURE 94 shows the mass spectrum obtained from the A reaction for sequencing a portion of exon 7 of the 
p53 gene as described in Example 25. 

[0143] FIGURE 95 shows the mass spectrum of a p53 sequencing ladder for which 5nL of each reaction were trans- 
ferred to wells of a chip and measured by MALDI-TOF. 
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[0144] FIGURE 96A shows a MALDI-TO F mass spectra of a synthetic 50-mer (15.34 kDa) mixed with 27-mernc (non- 
complementary, 8.30 kDa). 

[0145] FIGURE 96B shows a MALDI-TOF mass spectra of a synthetic 50-mer (15.34k Da) mixed with a 27-mer(. 
(complementary, 8.34 kDa). The final concentration of each oligonucleotide was lOiiM. The signal at 23.68 kDa in 
5 Figure 96B corresponds to WC-specific dsDNA. 

[0146] FIGURE 97A shows a MALDI-TOF mass spectrum of Cfol/Rsal digest products of a region of exon 4 of the 
apolipoprotein E gene (G3 genotype), using sample preparation as in Figure 96. 

[0147] FIGURE 97B is the same as Figure 97A, except with samples prepared for MALDI-TOF analysis at 4'C. 
[0148] FIGURE 98 shows a MALDI-TOF mass spectrum of Cfol/Rsal simultaneously double digest products of a 
10 252 base pair region of exon 4 of the apolipoprotein E gene (e4 genotype), with samples prepared at 4*0. 

[0149] FIGURE 99 shows the mass spectra obatined on a small population study of 15 patients with a 16 element 
array of diagnostic products transferred to a MALDI target using a pintool microdispenser 

[0150] FIGURE 100 Is a MALDI mass spectrum of an aliquot sampled after a T.| digest of a synthetic 20-mer RNA. 
15 DETAILED DESCRIPTION OF THE INVENTION 
Definitions 

[0151] Unless defined otherwise, all technical and scientific temns used herein have the same meaning as is com- 
20 monly understood by one of skill in the art to which this Invention belongs. Where permitted the subject matter of each 
of the co-pending patent applications and the patent is herein incorporated in its entirety. 

[0152] As used herein, the term "biological sample" refers to any material obtained from any living source ( e.g. , 
human, animal, plant, bacteria, fungi, protist, virus). For purposes herein, the biological sample will typically contain a 
nucleic acid molecule. Examples of appropriate biological samples include, but are not limited to: solid materials ( e.g. , 
25 tissue, cell pellets, biopsies) and biological fluids ( e.g. , urine, blood, saliva, amniotic fluid, mouth wash, cerebral spinal 
fluid and other body fluids). 

[0153] As used herein, the phrases "chain-elongating nucleotides" and "chain-terminating nucleotides" are used in 
accordance with their art recognized meaning. For example, for DNA, chain-elongating nucleotides include 2'deoxyri- 
bonucleotides ( e.g. , dATP, dCTP, dGTP and dTTP) and chain-terminating nucleotides include 2', 3'-dideoxyribonucle- 
30 otides ( e.g. , ddATP, ddCTP, ddGTP, ddTTP). For RNA, chain-elongating nucleotides include ribonucleotides ( e.g. , 
ATJP, OTP, GTP and UTP) and chain-terminating nucleotides include 3'-deoxyribonucleotides ( e.g. , 3'dA, 3'dC, 3'dG 
and3'dU). A complete set of chain elongating nucleotides refers to dATP, d OTP, dGTP and dTTP. The term "nucleotide" 
is also well known in the art. 

[0154] As used herein, nucleotides include nucleoside mono-, di-, and triphosphates. Nucleotides also include mod- 
35 ified nucleotides such as phosphorothioate nucleotides and deazapurine nucleotides. A complete set of chain-elon- 
gating nucleotides refers to four different nucleotides that can hybridize to each of the four different bases comprising 
the DNA template. 

[0155] As used herein, the superscript O-i designates i + 1 mass differentiated nucleotides, primers or tags. In some 
instances, the superscript O can designate an unmodified species of a particular reactant, and the superscript 1 can 

40 designate the i-th mass-modified species of that reactant. If, for example, more than one species of nucleic acids are 
to be concurrently detected, then i + 1 different mass-modified detector oligonucleotides (0°, D"", ... D') can be used 
to distinguish each species of mass modified detector oligonucleotides (D) from the others by mass spectrometry. 
[0156] As used herein, "multiplexing" refers to the simultaneously detection of more than one analyte, such as more 
than one (mutated) loci on a particular captured nucleic acid fragment (on one spot of an array). 

45 [0157] As used herein, the term "nucleic acid" refers to single-stranded and/or double-stranded polynucleotides such 
as deoxyribonucleic acid (DNA), and ribonucleic acid (RNA) as well as analogs or derivatives of either RNA or DNA. 
Also included in the term "nucleic acid" are analogs of nucleic acids such as peptide nucleic acid (RNA), phospho- 
rothioate DNA, and other such analogs and derivatives. 

[0158] As used herein, the term "conjugated" refers stable attachment, preferably ionic or covalent attachment. 

50 Among preferred conjugation means are: streptavidin- or avidin- to biotin interaction; hydrophobic interaction; magnetic 
interaction ( e.g. , using functionalized magnetic beads, such as DYNABEADS, which are streptavidin-coated magnetic 
beads sold by Dynal, Inc. Great Neck, NY and Oslo Norway); polar interactions, such as "wetting" associations between 
two polar surfaces or between oligo/polyethylene glycol; formation of a covalent bond, such as an amide bond, disulfide 
bond, thioether bond, or via crosslinking agents; and via an acid-labile or photocleavable linker. 

55 [0159] As used herein equivalent, when referring to two sequences of nucleic acids means that the two sequences 
in question encode the same sequence of amino acids or equivalent proteins. When "equivalent" is used in referring 
to two proteins or peptides, it means that the two proteins or peptides have substantially the same amino acid sequence 
with only conservative amino acid substitutions that do not substantially alter the activity or function of the protein or 
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peptide. When "equivalent" refers to a property, the property does not need to be present to the same extent [e.g. , two 
peptides can exhibit different rates of the same type of enzymatic activity], but the activities are preferably substantially 
the same. "Complementary," when referring to two nucleotide sequences, means thatthe two sequences of nucleotides 
are capable of hybridizing, preferably with less than 25%, more preferably with less than 15%, even more preferably 
5 with less than 5%, most preferably with no mismatches between opposed nucleotides. Preferably the two molecules 
will hybridize under conditions of high stringency. 

[0160] As used herein: stringency of hybridization in detemnining percentage mismatch are those conditions under- 
stood by those of skill in the art and typically are substantially equivalent to the following: 

10 1) high stringency: 0.1 x SSPE, 0.1% SDS, es-'C 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50*C 

3) low stringency: 1.0 x SSPE, 0.1% SDS, 50''C 

It Is understood that equivalent stringencies may be achieved using alternative buffers, salts and temperatures. 

15 [0161] As used herein, aprimerwhensetforth intheclaims refers to a primersuitable for mass spectrometric methods 
requiring immobilizing, hybridizing, strand displacement, sequencing mass spectrometry refers to a nucleic acid must 
be of low enough mass, typically about 70 nucleotides or less than 70, and of sufficient size to be useful in the mass 
spectrometric methods described herein that rely on mass spectrometric detection. These methods include primers 
for detection and seequening of nucleic acids, which require a sufficient number nucleotides to from a stable duplex, 

20 typically about 6-30, preferably about 10-25, mors preferably about 12-20. Thus, for purposes herein a primer will be 
a sequence of nucleotides comprising about 6-70, more preferably a 12-70, more preferably greater than about 14 to 
an upper limit of 70, depending upon sequence and application of the primer. The primers herein, for example for 
mutational analyses, are selected to be upstream of loci useful for diagnosis such that when perfomning using sequenc- 
ing up to or through the site of interest, the resulting fragment is of a mass that sufficient and not too large to be detected 

25 by mass spectrometry. For mass spectrometric methods, mass tags or modifier are preferably included at the 5'-end, 
and the primer is othen/vise unlabeled. 

[0162] As used herein, "conditioning" of a nucleic acid refers to modification of the phosphodiester backbone of the 
nucleic acid molecule ( e.g. , cation exchange) for the purpose of eliminating peak broadening due to a heterogeneity 
in the cations bound per nucleotide unit. Contacting a nucleic acid molecule with an alkylating agent such as akyliodide, 

30 iodoacetamide, p-iodoethanol, or2,3-epoxy-1 -propanol, the monothio phosphodiester bonds of a nucleic acid molecule 
can be transformed into a phosphotriester bond. Likewise, phosphodiester bonds may be transformed to uncharged 
derivatives employing trialkylsilyl chlorides. Further conditioning involves incorporating nucleotides that reduce sensi- 
tivity for depurination (fragmentation during MS) e.g. , a purine analog such as N7- or N9-deazapurine nucleotides, or 
RNA building blocks or using oligonucleotide triesters or incorporating phosphorothioate functions that are alkylated 

35 or employing oligonucleotide mimetics such as peptide nucleic acid (PNA). 

[0163] As used herein, substrate refers to an Insoluble support onto which a sample Is deposited according to the 
materials described herein. Examples of appropriate substrates Include beads (e.g., silica gel, controlled pore glass, 
magnetic, agaroase gele and crosslinked dextroses (I.e. Sepharose and Sephadex, cellulose and other materials 
known by those of skill in the art to serve as solid support matrices. For examples substrates may be formed from any 

40 or combitions of: silica gel, glass, magnet, polystyrene/1% divinylbenzene resins, such as Wang resins, which are 
Fmoc-amino acid-4-(hydroxymethyl)phenoxymethylcopoly(styrene-1 % divinylbenzene (DVD)) resin, chlorotrityl 
(2-chlorotrltylchlorlde copolystyrene-DVB resin) resin, Merrlfleld (chloro methylated co polystyrene- DVB) resin metal, 
plastic, cellulose, cross-linked dextrans, such as those sold under the tradename Sephadex (Pharmacia) and agarose 
gel, such as gels sold under the tradename Sepharose (Pharmacia), which is a hydrogen bonded polysaccharide-type 

45 agarose gel, and other such resins and solid phase supports known to those of skill In the art. The support matrices 
may be in any shape or form, including, but not limited to: capillaries, flat supports such as glass fiber filters, glass 
surfaces, metal surfaces (steel, gold, silver, aluminum, copper and silicon), plastic materials including multiwell plates 
or membranes (e.g., of polyethylene, polypropylene, polyamide, polyvinylidenedifluoride), pins (e.g., arrays of pins 
suitable for combinatorial synthesis or analysis or beads in pits of flat surfaces such as wafers (e.g., silicon wafers) 

50 with or without plates, and beads. 

[0164] As used herein, a selectively cleavable linker is a linker that is cleaved under selected conditions, such as a 
photocleavable linker, a chemically cleavable linker and an enzymatically cleavable linker (i.e., a restriction endonu- 
clease site or a rIbonucleotlde/RNase digestion). The linker Is Interposed between the support and Immobilized DNA. 

55 Isolation of nucleic acids molecuies 

[0165] Nucleic acid molecules can be isolated from a particular biological sample using any of a number of proce- 
dures, which are well-known in the art, the particular isolation procedure chosen being appropriate for the particular 
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biological sample. For example, freeze-thaw and alkaline lysis procedures can be useful for obtaining nucleic acid 
molecules from solid materials; heat and alkaline lysis procedures can be useful for obtaining nucleic acid molecules 
from urine; and proteinase K extraction can be used to obtain nucleic acid from blood (see, e.g. , Rolff et al. (1994) 
PGR: Clinical Diagnostics and Research, Springer). 

5 [0166] To obtain an appropriate quantity of a nucleic acid molecules on which to perfomi mass spectrometry, ampli- 
fication may be necessary. Examples of appropriate amplification procedures for use herein include: cloning (Sambrook 
et al., Molecular Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory Press, 1989), polymerase chain reaction 
(PGR) (C.R. Newton and A. Graham, PCR, BIOS Publishers, 1994), ligase chain reaction (LCR) (see, e.g., Weidmann 
et al, (1 994) PCR Methods Appl . Vol. 3, Pp. 57-64; F. Barany (1 991 ) Proc. Natl. Acad. Sci. U.S.A. 88; 1 89-93), strand 

10 displacement amplification (SDA) (see, e.g. . Walker etal, (1994) Nucleic Acids Res. 22:2670-77) and variations such 
as RT-PCR (see, e.g. , HIguchi et aL (1993) Bio/Technology 11_:1 026-1 030), allele-specific amplification (ASA) and 
transcription based processes. 

Immobilization of nucleic acid molecules to solid supports 

15 

[0167] To facilitate mass spectrometric analysis, a nucleic acid molecule containing a nucleic acid sequence to be 
detected can be immobilized to an insoluble (i.e., a solid) support. Examples of appropriate solid supports include 
beads (e.g. , silica gel, controlled pore glass, magnetic, Sephadex/Sepharose, cellulose), capillaries, flat supports such 
as glass fiber filters, glass surfaces, metal surfaces (steel, gold, silver, aluminum, copper and silicon), plastic materials 
20 including multiwell plates or membranes (e.g. , of polyethylene, polypropylene, polyamide, polyvinylidenedifluoride), 
pins ( e.g. , arrays of pins suitable for combinatorial synthesis or analysis or beads in pits of flat surfaces such as wafers 
( e.g. , silicon wafers) with or without filter plates. 

[0168] Samples containing target nucleic acids can be transferred to solid supports by any of a variety of methods 
known to those of skill in the art. For example, nucleic acid samples can be transferred to individual wells of a substrate, 
25 e.g. , silicon chip, manually or using a pintool microdispenser apparatus as described herein. Alternatively, a piezoe- 
lectric pipette apparatus can be used to transfer small nanoliter samples to a substrate permitting the performance of 
high throughput miniaturized diagnostics on a chip. 

[0169] Immobilization can be accomplished, for example, based on hybridization between a capture nucleic acid 
sequence, which has already been immobilized to the support and a complementary nucleic acid sequence, which is 

30 also contained within the nucleic acid molecule containing the nucleic acid sequence to be detected (FIGURE 1 A). So 
that hybridization between the complementary nucleic acid molecules is not hindered by thesupport, the capture nucleic 
acid can include an e.g. , spacer region of at least about five nucleotides in length between the solid support and the 
capture nucleic acid sequence. The duplex fomned will be cleaved under the influence of the laser pulse and desorption 
can be initiated. The solid support-bound nucleic acid molecule can be presented through natural oligoribo- or oligo- 

35 deoxyribonucleotide as well as analogs (e.g. , thio-modif led phosphodiester or phosphotriester backbone) or employing 
oligonucleotide mimetics such as PNA analogs (see, e.g. , Nielsen et aL, Science 254 :1497 (1991)) which render the 
base sequence less susceptible to enzymatic degradation and -bound capture base sequence. 

Linkers 

40 

[0170] A target detection site can be directly linked to a solid support via a reversible or irreversible bond between 
an appropriate functionality (L) on the target nucleic acid molecule (T) and an appropriate functionality (L) on the 
capture molecule (FIGURE 1 B). A reversible linkage can be such that it is cleaved under the conditions of mass spec- 
trometry (i.e., a photocleavable bond such as a charge transfer complex or a labile bond being formed between relatively 

45 Stable organic radicals). 

[0171] Photocleavable linkers are linkers that are cleaved upon exposure to light (see, e.g. , Goldmacher etal. (1992) 
Bioconj. Chem. 3:104-107), thereby releasing the targeted agent upon exposure to light. Photocleavable linkers that 
are cleaved upon exposure to light are known (see, e.g. , Hazum et aL (1981) in Pept., Proc. Eur Pept. Symp., 16th , 
Brunfeldt, K (Ed), pp. 105-110, which describes the use of a nitrobenzyl group as a photocleavable protective group 

50 for cysteine; Yen et aL (1 989) Makromol. Chem 1 90 :69-82, which describes water soluble photocleavable copolymers, 
including hydroxypropylmethacrylamide copolymer, glycine copolymer, fluorescein copolymer and methylrhodamine 
copolymer; Goldmacher et al. (1 992) Bioconj. Chem. 3 :104-1 07, which describes a cross-linker and reagent that un- 
dergoes photolytic degradation upon exposure to near UV light (350 nm); and Senter et al. (1 985) Photochem. Photobiol 
42:231 -237, which describes nitrobenzyloxycarbonyl chloride cross linking reagents that produce photocleavable link- 

55 ages), thereby releasing the targeted agent upon exposure to light. In preferred embodiments, the nucleic acid is 
immobilized using the photocleavable linker moiety that is cleaved during mass spectrometry. Presently preferred 
photocleavable linkers are set forth in the EXAMPLES. 

[0172] Furthermore, the linkage can beformed with L' being a quaternary ammonium group, in which case, preferably. 
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the surface of the solid support carries negative charges which repel the negatively charged nucleic acid backbone 
and thus facilitate the desorption required for analysis by a mass spectrometer. Desorption can occur either by the 
heat created by the laser pulse and/or, depending on L,' by specific absorption of laser energy which is in resonance 
with the L' chromophore. 

5 [0173] Thus, the L-L' chemistry can be of a type of disulfide bond (chemically cleavable, for example, by mercap- 
toethanol or dithioerythrol), a biotin/streptavidin system, a heterobifunctional derivative of a trityl ether group (see, e. 
^, Koster et al, (1990) "A Versatile Acid-Labile Linker for Modification of Synthetic Biomolecules," Tetrahedron Letters 
31 :7095) that can be cleaved under mildly acidicconditions as well as under conditions of mass spectrometry, a levulinyl 
group cleavable under almost neutral conditions with a hydrazinium/acetate buffer, an arginine-arginine or lysine-lysine 

10 bond cleavable by an endopeptidase enzyme like trypsin or a pyrophosphate bond cleavable by a pyrophosphatase, 
or a ribonucleotide bond in between the oil godeoxy nucleotide sequence, which can be cleaved, for example, by a 
ribonuclease or alkali. 

[0174] The functionalities, L and L," can also form a charge transfer complex and thereby form the temporary L-L' 
linkage. Since in many cases the "charge-transfer band" can be determined by UV/vis spectrometry (see, e.g. , Organic 

15 Charge Transfer Complexes by R. Foster, Academic Press, 1 969), the laser energy can be tuned to the corresponding 
energy of the charge-transfer wavelength and, thus, a specific desorption off the solid support can be initiated. Those 
skilled in the art will recognize that several combinations can serve this purpose and that the donor functionality can 
be either on the solid support or coupled to the nucleic acid molecule to be detected or vice versa. 
[0175] In yet another approach, a reversible L-L' linkage can be generated by homolytically forming relatively stable 

20 radicals. Under the influence of the laser pulse, desorption (as discussed above) as well as ionization will take place 
at the radical position. Those skilled in the art will recognize that other organic radicals can be selected and that, in 
relation to the dissociation energies needed to homolytically cleave the bond between them, a corresponding laser 
wavelength can be selected (see e.g. , Reactive Molecules by C. Wentrup, John Wiley & Sons, 1984). 
[0176] An anchoring function L'can also be incorporated into a target capturing sequence (TCS) by using appropriate 

25 primers during an amplification procedure, such as PGR (FIGURE 4), LCR (FIGURE 5) or transcription amplification 
(FIGURE 6A). 

[0177] When performing exonuclease sequencing using MALDI-TOF MS, a single stranded DMA molecule immobi- 
lized via its 5-end to a solid support is unilaterally degraded with a3'-processive exonuclease and the molecular weight 
of the degraded nucleotide is determined sequentially. Reverse Sanger sequencing reveals the nucleotide sequence 

30 of the immobilized DNA. By adding a selectively cleavable linker, not only can the mass of the free nucleotides be 
determined but also, upon removal of the nucleotides by washing, the mass of the remaining fragment can be detected 
by MALDI-TOF upon cleaving the DNA from the solid support. Using selectively cleavable linkers, such as the photo- 
cleavable and chemical cleavable linkers provided herein, this cleavage can be selected to occur during the ionization 
and volatizing steps of MALDI-TOF. The same rationale applies for a 5' immobilized strand of a double stranded DNA 

35 that is degraded while in a duplex. Likewise, this also applies when using a 5'-processive exonuclease and the DNA 
is immobilized through the 3'-end to the solid support. 

[0178] As noted, at least three version of immobilization are contemplated herein: 1) the target nucleic acid is am- 
plified or obtained (the target sequence or surrounding DNA sequence must be known to make primers to amplify or 
isolated); 2) the primer nucleic acid is immobilized to the solid support and the target nucleic acid is hybridized thereto 

40 (this is for detecting the presence of or sequencing a target sequence in a sample); or 3) a double stranded DNA 
(amplified or isolated) is immobilized through linkage to one predetermined strand, the DMA is denatured to eliminate 
the duplex and then a high concentration of a complementary primer or DNA with identity upstream from the target 
site is added and a strand displacement occurs and the primer is hybridized to the immobilized strand. 
[0179] In the embodiments where the primer nucleic acid is immobilized on the solid support and the target nucleic 

45 acid is hybridized thereto, the inclusion of the cleavable linker allows the primer DNA to be immobilized at the 5'-end 
so that free 3'-OH is available for nucleic acid synthesis (extension) and the sequence of the "hybridized" target DNA 
can be determined because the hybridized template can be removed by denaturation and the extended DNA products 
cleaved from the solid support for MALDI-TOF MS. Similarly for 3), the immobilized DNA strand can be elongated when 
hybridized to the template and cleaved from the support. Thus, Sanger sequencing and £rim6r oligo base extension 

50 (PROBE), discussed below, extension reactions can be performed using an immobilized primer of a known, upstreamn 
DNA sequence complementary to an invariable region of a target sequence. The nucleic acid from the person is ob- 
tained and the DNA sequence of a variable region (deletion, insertion, missense mutation that cause genetic predis- 
position or diseases, or the presence of viral/bacterial or fungal DNA) not only is detected, but the actual sequence 
and position of the mutation is also determined. 

55 [0180] In other cases, thetarget DNA must be immobilized and the primer annealed. This requires amplifying a larger 
DNA based on known sequence and then sequencing the immobilized fragments (i.e., the extended fragments are 
hybridized but not immobilized to the support as described above). In these cases, it is not desirable to include a linker 
because the MALDI-TOF spectrum is of the hybridized DNA; it is not necessary to cleave the immobilized template. 
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[0181] Any linker known to those of skill in the artfor imnnobilizing nucleic acids to solid supports may be used herein 
to link the nucleic acid to a solid support. The preferred linkers herein are the selectively cleavable linkers, particularly 
those exemplified herein. Other linkers include, acid cleavable linkers, such as bismaleimideothoxy propane, acid- 
labile trityl linkers. 

[0182] Acid cleavable linkers, photocleavable and heat sensitive linkers may also be used, particularly where it may 
be necessary to cleave the targeted agent to permit it to be more readily accessible to reaction. Acid cleavable linkers 
include, but are not limited to, bismaleimideothoxy propane; and adipic acid dihydrazide linkers (see, e.g. , Fattom et 
al. (1992) Infection & Immun. 60:584-589) and acid labile transferrin conjugates that contain a sufficient portion of 
transferrin to permit entry into the intracellular transferrin cycling pathway (see, e.g. , Welhoner et al. (1991) J. Biol. 
Chem. 266 :4309-4314). 

Photocleavable Linkers 

[0183] Photocleavable linkers are provided. In particular, photocleavable linkers as their phosphoramidite derivatives 
are provided for use in solid phase synthesis of oligonucleotides. The linkers contain o-nitrobenzyl moieties and phos- 
phate linkages which allow for complete photolytic cleavage of the conjugates within minutes upon UV irradiation. The 
UV wavelengths used are selected so that the irradiation will not damage the oligonucleotides and are preferrably 
about 350-380 nm, more preferably 365 nm. The photocleavable linkers provided herein possess comparable coupling 
efficiency as compared to commonly used phosphoramidite monomers (see, Sinha et aL (1983) Tetrahedron Lett. 24 : 
5843-5846; Sinha et aL (1 984) Nucleic Acids Res. 12:4539-4557; Beaucage et aL (1 993) Tetrahedron 49:61 23-61 94; 
and Matteucci et al (1 981 ) J. Am. Chem. Soc. 103:31 85-31 91 ). 
[0184] In one embodiment, the photocleavable linkers have formula I: 



(I) 


where R^o is tt>-(4,4'-dimethoxytrityloxy)alkyl or co-hydroxyalkyi; R^"" is selected from hydrogen, alkyi, aryl, alkoxycar- 
bonyl, aryloxycarbonyl and carboxy; R22 is hydrogen or (dialkylamino)(fi>-cyanoalkoxy)P-; t is 0-3; and R^o is alkyI, 
alkoxy, aryl or aryloxy. 
40 [0185] In a preferred embodiment, the photocleavable linkers have formula II: 



(il) 


where R^o is w-(4,4'-dimethoxytrityloxy)alkyl, w-hydroxyalkyi or alkyI; R^"" is selected from hydrogen, alkyi, aryl, alkox- 
ycarbonyl, aryloxycarbonyl and carboxy; R22 is hydrogen or (dialkylamino)(co-cyanoalkoxy)P-; and X^o is hydrogen, 
alkyI or OR20. 
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[0186] In particularly preferred embodiments, R^o is 3-(4,4'-dimethoxytrityloxy)propyl, 3- hydroxy propyl or methyl; 
R21 is selected from hydrogen, methyl and carboxy; R^^ is hydrogen or (diisopropylamino)(2-cyanoethoxy)P-; and X^o 
is hydrogen, methyl or OR^o. In a more preferred embodiment, R^o is 3-(4,4'-dimethoxytrityloxy)propyl; R^i is metliyl; 
R22 is (diisopropylamino)(2-cyanoethoxy)P-; and X^o is hydrogen. In another more preferred embodiment, R^o is me- 
thyl; R21 is methyl; R22 is (diisopropylamino)(2-cyanoethoxy)P-; and X^o is 3-(4,4'-dimethoxytrityloxy)propoxy. 
[0187] In another embodiment, the photocleavable linkers have formula III: 


where R23 is hydrogen or (dialkylamino)(<»-cyanoalkoxy)P-; and R24 is selected from o)-hydroxyalkoxy, co-(4,4'-dimeth- 
oxytrityloxy)alkoxy, (o-hydroxyalkyi and fi)-(4,4'-dimethoxytrityloxy)alkyl, and is unsubstituted or substituted on the alkyi 
or alkoxy chain with one or more alky! groups; r and s are each independently 0-4; and R^° is alkyi, alkoxy, aryl or 
aryloxy. In certain embodiments, R^"* is oo-hydroxyalkyi or a)-(4,4'-dimethoxytrityloxy)alkyl, and is substituted on the 
alkyi chain with a methyl group. 

[0188] In preferred embodiments, R23 is hydrogen or (diisopropylamino)(2-cyanoethoxy)P-; and R^-^^ is selected from 
3-hydroxypropoxy, 3-(4,4'-dimethoxytrityloxy)propoxy, 4-hydroxybutyl, 3 -hydroxy-1 -propyl, 1-hydroxy-2-propyl, 3-hy- 
droxy-2-methyl-1 -propyl, 2-hydroxyethyl, hydroxy methyl, 4-(4,4'-dimethoxytrityloxy)butyl, 3-(4,4'-dimethoxytrityloxy)- 
1 -propyl, 2-(4,4'-dimethoxytrityloxy)ethyl, 1 -(4,4'-dimethoxytrityloxy)-2-propyl, 3-(4,4'-dimethoxytriyloxy)-2-methyl- 
1 -propyl and 4, 4'-dimethyoxytrityloxy methyl. 

[0189] In more preferred embodiments, R23 is (diisopropylamino)(2-cyanoethoxy)P-; r and s are O; and R^-*- is se- 
lected from 3-(4,4'-dimethoxytrityloxy)propoxy, 4-(4,4'-dimethoxytrityloxy)butyl, 3-(4,4'-dimethoxytrityloxy) propyl, 2- 
(4,4'-dimethoxytrityloxy)ethyl, 1-(4,4'-dimethoxytrityloxy)-2-propyl, 3-(4,4'-dimethoxytriyloxy)-2-methyl-1 -propyl and 
4,4'-dlmethyoxytrityloxymethyl. R^-^ is most preferably 3-(4,4'-dimethoxytrityloxy)propoxy. 

Preparation of the photocleavable linkers 

A. Preparation of photocleavable linkers of formulae I or 11 

[0190] Photocleavable linkers of formulae I or II may be prepared by the methods described below, by minor modi- 
fication of the methods by choosing the appropriate starting materials or by any other methods known to those of skill 
in the art. Detailed procedures for the synthesis of photocleavable linkers of formula II are provided in the Examples. 
[0191] In the photocleavable linkers of formula II where X^o is hydrogen, the linkers may be prepared in the following 
manner. Alkylation of 5-hydroxy-2-nitrobenzaldehyde with an to-hydroxyaikyl halide, e.g. , 3-hydroxypropyl bromide, 
followed by protection of the resulting alcohol as, e.g. , asilyl ether, provides a 5-{tt>-silyloxyalkoxy)-2-nitrobenzaldehyde. 
Addition of an o rg an o metal lie to the aldehyde affords a benzylic alcohol. Organometallics which may be used include 
trialkyiaiuminums (for linkers where R^"" is alkyi), such as trimethyiaiuminum, borohydrides (for linkers where R^^ is 
hydrogen), such as sodium borohydride, or metal cyanides (for linkers where R^i is carboxy or alkoxycarbonyl), such 
as potassium cyanide. In the case of the metal cyanides, the product of the reaction, a cyanohydrin, would then be 
hydrolyzed under either acidic or basic conditions in the presence of either water or an alcohol to afford the compounds 
of interest. 

[0192] The silyl group of the side chain of the resulting benzylic alcohols may then be exchanged for a 4,4'-dimeth- 
oxytriyl group by desilylation with, e.g. , tetrabutylammonium fluoride, to give the corresponding alcohol, followed by 
reaction with 4,4'-dimethoxytrityl chloride. Reaction with, e.g. , 2-cyanoethyl diisopropylchlorophosphoramidite affords 
the linkers where R22 is (dialkylamino)(cf)-cyanoalkoxy)P-. 

[0193] A specific example of a synthesis of a photocleavable linker of formula II is shown in the following scheme. 



NO, 
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which also demonstrates use of the linker in oligonucleotide synthesis. This scheme is intended to be illustrative only 
and in no way limits the scope of the invention. Experimental details of these synthetic transformations are provided 
in the Examples. 
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[0194] Synthesis of the linkers of formula II where X^o is OR^o, 3,4-dihydroxyacetophenone is protected selectively 
at the 4-hydroxyl by reaction with, ex[., potassium carbonate and a silyl chloride. Benzoate esteres, propiophenones, 
butyrophenones, etc. nnay be used in place of the acetophenone. The resulting 4-silyloxy-3-hydroxyacetophenone is 
then alkylated at the with an alkyi halide (for linkers where R^o is alkyi) at the 3-hydroxyl and desilylated with, e.g. , 
tetrabuyl ammonium fluoride to afford a 3-alkoxy-4-hydroxyacetophenone. This compound is then alkylated at the 4-hy- 
droxyl by reaction with an to-hydroxyaiky! halide, e.g. , 3- hydroxy propyl bromide, to give a4-(tt>-hydroxyalkoxy)-3-alkoxy- 
acetophenone. The side chain alcohol is then protected as an ester, e.g. , an acetate. This compound is then nitrated 
at the 5-position with, e.g. , concentrated nitric acid to provide the corresponding 2-nitroacetophenones. Saponification 
of the side chain ester with, e.g. , potassium carbonate, and reduction of the ketone with, e.g. , sodium borohydride, in 
either order gives a 2-nitro-4-(co-hydroxyalkoxy)-5-alkoxybenzylic alcohol. 

[0195] Selective protection of the side chain alcohol as the corresponding 4,4'-dimethoxytrityl ether is then accom- 
plished by reaction with 4,4'-dimethoxytrityl chloride. Further reaction with, e.g. , 2-cyanoethyl diisopropylchlorophos- 
phoramidite affords the linkers where R22 is (dialkylamino)(o)-cyanoalkoxy)P-. 

[0196] A specific example of the synthesis of a photocleavable linker of formula II is shown the following scheme. 
This scheme is intended to be illustrative only and in no way limit the scope of the invention. Detailed experimental 
procedures for the transformations shown are found in the Examples. 


OH 
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B. Preparation of photocleavable linkers of formula III 

[0197] Photocleavable linkers of fomnula iil may be prepared by the methods described below, by minor modification 
of the methods by choosing appropriate starting materials, or by other methods known to those of skill in the art. 
[0198] In general, photocleavable linkers of formula III are prepared from w-hydroxyalkyl- or alkoxyaryl compounds, 
in particular co-hydroxy-alky I or alkoxy-benzenes. These compounds are commercially available, or may be prepared 
from an (o-hydroxyalkyi halide ( e.g. , 3-hydroxypropyl bromide) and either phenyilithium (for the to- hydroxy a Iky I ben- 
zenes) or phenol (for the ro-hydroxyalkoxybenzenes). Acylation of the o>-hydroxyl group (e.g. , as an acetate ester) 
followed by Friede I -Crafts acylation of the aromatic ring with 2-nitrobenzoyl chloride provides a 4-(co-acetoxy-alkyl or 
alkoxy)-2-nitrobenzophenone. Reduction of the ketone with, e.g. , sodium borohydride, and saponification of the side 
chain ester are perfomned in either order to afford a 2-nitrophenyl-4-(hydroxy-alkyl or alkoxy)phenylmethanol. Protec- 
tion of the terminal hydroxyl group as the corresponding 4,4'-dimethoxytrityl ether is achieved by reaction with 4,4'- 
dimethoxytrityl chloride. The benzylic hydroxyl group is then reacted with, e.g. , 2-cyanoethyl diisopropylchlorophos- 
phoramidite to afford linkers of formula II where R23 is (dialkylamino)((o-cyanoalkoxy)P-. 

[0199] Other photocleavable linkers of formula III may be prepared by substituting 2-phenyl-1-propanol or2-phenyl- 
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methyl-1-propanolfortheo>-hydroxy-alkyl or alkoxy-benzenes in the above synthesis. These compounds are commer- 
cially available, but may also beprepared by reaction of, e.g. , phenylmagnesium bromide or benzylmagnesium bromide, 
with the requisite oxirane (Le., propylene oxide) in the presence of catalytic cuprous ion. 

5 Chemically cleavable linkers 

[0200] A variety of chemically cleavable linkers may be used to introduce a cleavable bond between the immobilized 
nucleic acid and the solid support. Acid-labile linkers are presently preferred chemically cleavable linkers for mass 
spectrometry, especially MALDI-TOF MS, because the acid labile bond is cleaved during conditioning of the nucleic 

10 acid upon addition of the 3-HPA matrix solution. The acid labile bond can be introduced as a separate linker group, e. 

the acid labile trityl groups (see Figure 68; Example 1 6) or may be incorporated in a synthetic nucleic acid linker by 
introducing one or more silyl internucleoside bridges using diisopropylsilyl, thereby forming diisopropylsilyl-linked oli- 
gonucleotide analogs. The diisopropylsilyl bridge replaces the phoshodiester bond in the DNA backbone and under 
mildly acidic conditions, such as 1 .5%trifluoroacotic acid (TFA) or3-HPA/1%TFA MALDI-TOF matrix solution, results 

15 in the introduction of one or more intra-strand breaks in the DNA molecule. Methods for the preparation of diisopro- 
pylsilyl-linked oligonucleotide precursors and analogs are known to those of skill in the art (see e.g. , Saha et al. (1993) 
. Org. Chem. 58 :7827-7831 ). These oligonucleotide analogs may be readily prepared using solid state oligonucleotide 
synthesis methods using diisopropylsilyl derivatized deoxyribonucleosides. 

20 Nucleic acid conditioning 

[0201] Prior to mass spectromethc analysis, it may be useful to "condition" nucleic acid molecules, for example to 
decrease the laser energy required for volatilization and/or to minimize fragmentation. Conditioning is preferably per- 
formed while a target detection site is immobilized. An example of conditioning is modification of the phosphodiester 

25 backbone of the nucleic acid molecule ( e.g. , cation exchange), which can be useful for eliminating peak broadening 
due to a heterogeneity in the cations bound per nucleotide unit. Contacting a nucleic acid molecule with an alkylating 
agent such as akyliodide, iodoacetamide, |3-iodoethanol, or 2,3-epoxy-1 -propanoi, the monothio phosphodiester bonds 
of a nucleic acid molecule can be transformed into a phosphotri ester bond. Likewise, phosphodiester bonds may be 
transformed to uncharged derivatives employing trialkylsilyl chlorides. Further conditioning involves incorporating nu- 

30 cleotides that reduce sensitivity for depurination (fragmentation during MS) e.g. , a purine analog such as N7-orN9-dea- 
zapurine nucleotides, or RNA building blocks or using oligonucleotide triesters or incorporating phosphorothioate func- 
tions which are alkylated or employing oligonucleotide mimetics such as PNA. 

Multiplex reactions 

35 

[0202] For certain applications, it may be useful to simultaneously detect more than one (mutated) loci on a particular 
captured nucleic acid fragment (on one spot of an array) or it may be useful to perform parallel processing by using 
oligonucleotide or oligonucleotide mimetic arrays on various solid supports. "Multiplexing" can be achieved by several 
different methodologies. For example, several mutations can be simultaneously detected on one target sequence by 
40 employing corresponding detector (probe) molecules (e.g. , oligonucleotides or oligonucleotide mimetics). The molec- 
ular weight differences between the detector oligonucleotides D1 , D2 and D3 must be large enough so that simulta- 
neous detection (multiplexing) is possible. This can be achieved either by the sequence itself (composition or length) 
or by the introduction of mass-modifying functionalities Ml -M3 into the detector oligonucleotide (seel FIGURE 2). 

45 Mass modification of nucleic acids 

[0203] Mass modifying moieties can be attached, for instance, to either the 5' -end of the oligonucleotide (M""), to the 
nucleobase (or bases) (M^, M''), to the phosphate backbone (M^), and to the 2'-position of the nucleoside (nucleosides) 
(M"*, M®) and/or to the terminal 3'-position (M^). Examples of mass modifying moieties include, for example, a halogen, 

50 an azido, or of the type, XR, wherein X is a linking group and R is a mass-modifying functionality. The mass-modifying 
functionality can thus be used to introduce defined mass increments into the oligonucleotide molecule. 
[0204] The mass-modifying functionality can be located at different positions within the nucleotide moiety (see, e.g. , 
U.S. Patent No. 5,547,835 and International PCT application No. WO 94/21822). For example, the mass-modifying 
moiety, M, can be attached either to the nucleobase, M^ (in case of the c^ -deazanucleosides also to C-7, M^), to the 

55 triphosphate group at the alpha phosphate, M^, or to the 2'-position of the sugar ring of the nucleoside triphosphate, 
M"^ and M^. Modifications introduced at the phosphodiester bond (M4), such as with alpha-thio nucleoside triphos- 
phates, have the advantage that these modifications do not interfere with accurate Watson-Crick base-pairing and 
additionally allow for the one-step post-synthetic site-specific modification of the complete nucleic acid molecule e.g. , 
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via alkylation reactions (see, e.g. , Nakamaye et aL (1988) Nucl. Acids Res. 16 :9947-59). Particularly preferred mass- 
modifying functionalities are boron-modified nucleic acids since they are better incorporated into nucleic acids by 
polymerases (see, e^, Porter et al. (1995) Biochemistry 34 :11 963-11 969; Hasan etal. (1996) Nucleic Acids Res. 24 : 
21 50-21 57; Li et aL (1 995) Mud. Acids Res. 23:4495-4501 ) . 

5 [0205] Furthermore, the mass-modifying functionaiity can be added so as to affect chain termination, such as by 
attaching it to the 3'-position of the sugar ring in the nucleoside triphosphate, M^. For those sl<ilied in the art, it is ciear 
that many combinations can be used in the methods provided herein, in the same way, those skiiied in the art will 
recognize that chain-elongating nucleoside triphosphates can also be mass-modified in a similar fashion with numerous 
variations and combinations in functionality and attachment positions. 

10 [0206] Without being bound to any particular theory, the mass-modification, M, can be introduced for X in XR as well 
as using oligoVpolyethylene glycol derivatives for R. The mass-modifying increment in this case is 44, i.e. five different 
mass-modified species can be generated by just changing m from 0 to 4 thus adding mass units of 45 (m=0), 89 (m = 
1), 133 (m = 2), 177 (m = 3) and 221 (m = 4) to the nucleic acid molecule (e.g. , detector oligonucleotide (D) or the 
nucleoside triphosphates (FIGURE 6(C)), respectively). The oligo/polyethylene glycols can also be monoalkylated by 

15 a lower alkyi such as methyl, ethyl, propyl, Isopropyl, t-butyl and the like. A selection of linking functionalities, X, are 
also illustrated. Other chemistries can be used in the mass-modified compounds (see, e.g. , those described in Oligo- 
nucleotides and Analogues, A Practical Approach , F. Eckstein, editor, IRL Press, Oxford, 1991). 
[0207] In yet another embodiment, various mass-modifying functionalities, R, other than oligo/polyethylene glycols, 
can be selected and attached via appropriate linking chemistries, X. A simple mass-modification can be achieved by 

20 substituting H for halogens like F, CI, Br and/or I, or pseudohalogens such as CN, SCN, NCS, or by using different 
alkyI, aryl or aralkyi moieties such as methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, phenyl, substituted phenyl, benzyl, 
or functional groups such as CH2F, CHFg, CF3, Si(CH3)3, Si(CH3)2(C2H5), Si(CH3)(C2H5)2, Si(C2H5)3. Yet another 
mass-modification can be obtained by attaching homo- or heteropeptides through the nucleic acid molecule ( e.g. , 
detector (D)) or nucleoside triphosphates. One example, useful in generating mass-modified species with a mass 

25 increment of 57. is the attachment of oligoglycines, e.g. , mass-modifications of 74 (r=1 , m=0), 131 (r=1 , m = 1), 188 
(r = 1 , m = 2), 245 (r = 1 , m = 3) are achieved. Simple oligoamides also can be used, e.g. , mass-modifications of 74 
(r = 1 , m = 0), 88 (r=2, m = 0), 1 02 (r=3, m = 0), 1 1 6(r=4, m = 0), etc. are obtainable. Variations in additions to those 
set forth herein will be apparent to the skilled artisan. 

[0208] Different mass-modified detector oligonucleotides can be used to simultaneously detect all possible variants/ 
30 mutants simultaneously (FIGURE 6B). Alternatively, all four base permutations atthesiteof a mutation can be detected 
by designing and positioning a detector oligonucleotide, so that it serves as a primer for a DNA/RNA polymerase with 
varying combinations of elongating and terminating nucleoside triphosphates (FIGURE 6C). For example, mass mod- 
ifications also can be incorporated during the amplification process. 

[0209] FIGURE 3 shows a different multiplex detection format, in which differentiation is accomplished by employing 
35 different specific capture sequences which are position-specifically immobilized on a flat surface ( e.g. , a 'chip array'). 
If different target sequences T1-Tn are present, their target capture sites TCSI-TCSn will specifically interact with 
complementary immobilized capture sequences Cl-Cn. Detection is achieved by employing appropriately mass dif- 
ferentiated detector oligonucleotides D1 -Dn, which are mass modifying functionalities Ml-Mn. 

'io Mass spectrometric methods for sequencing DNA 

[0210] Amenable mass spectrometric formats for use herein include the ionization (I) techniques, such as matrix 
assisted laser desorption ionization (MALDI), electrospray (ESI) (e.g. , continuous or pulsed); and related methods (e. 
c[., lonspray, Thermospray, Fast Atomic Bombardment), and massive cluster impact (MCI); these ion sources can be 
45 matched with detection formats Including lin-linear fields) time-of-flight (TOF), single or multiple quadrupole, single or 
multiple magnetic sector, Fourier transform ion cyclotron resonance (FTICR), ion trap, or combinations of these to give 
a hybrid detector ( e.g. , ion trap - time of flight). For ionization, numerous matrix/wavelength combinations including 
frozen analyte preparation (MALDI) or solvent combinations (ESI) can be employed. 

[0211] Since a normal DNA molecule includes four nucleotide units (A, T, C, G), and the mass of each of these is 
50 unique (monoisotopic masses 313.06, 304.05, 289.05, 329.05 Da, respectively), an accurate mass determination can 
define or constrain the possible base compositions of that DNA. Only above 4900 Da does each unit molecular weight 
have at least one allowable composition; among all 5-mers there is only one non-unique nom/na/ molecular weight, 
among S-mers, 20. For these and larger oligonucleotides, such mass overlaps can be resolved with the - 1/1 0^ (- 
10 part per million, ppm) mass accuracy available with high resolution FTICR MS. For the 25-mer AgT2o, the 20 com- 
55 position degeneracies when measured at ±0.5 Da is reduced to three (A5T20, T4C^2'^9> AT3C4G.1Q) when measured 
with 2 ppm accuracy. Given composition constraints (e.g. , the presence or absence of one of the four bases in the 
strand) can reduce this further (see below). 

[0212] Medium resolution instrumentation, including but not exclusively curved field reflectron or delayed extraction 
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time-of-flight MS instruments, can also result in improved DNA detection for sequencing or diagnostics. Either of tliese 
are capable of detecting a 9 Da (Am (A-T)) shift in >30-mer strands generated from, for example primer oligo base 
extension (PROBE), or competitive oligonucleotide single base extension (COSBE), sequencing, or direct detection 
of small amplified products. 

5 

BiomassScan 

[0213] In this embodiment, exemplified in Example 33, two single stranded nucleic acids are individually immobilized 
to solid supports. One support contains a nucleic acid encoding the wild type sequence whereas the other support 

10 contains a nucleic acid encoding a mutant target sequence. Total human genomic DNA is digested with one or more 
restriction endonuclease enzyme resulting in the production of small fragments of double stranded genomic DNA 
(10-1 ,000 bp). The digested DNA is incubated with the immobilized single stranded nucleic acids and the sample is 
heated to denature the DNA duplex. The immobilized nucleic acid competes with the other genomic DNA strand for 
the complementary DNA strand and under the appropriate conditions, a portion of the complementary DNA strand 

15 hybridizes to the immobilized nucleic acid resulting in a strand displacement. By using high stringency washing con- 
ditions, the two nucleic acids will remain as a DNA duplex only if there is exact identity between the immobilized nucleic 
acid and the genomic DNA strand. The DNA that remains hybridized to the immobilized nucleic acid is analyzed by 
mass spectrometry and detection of a signal in the mass spectrum of the appropriate mass is diagnostic for the wild 
type or mutant allele. In this manner, total genomic DNA can be isolated from a biological sample and screened for 

20 the presence or absence of certain mutations. By immobilizing a variety of single stranded nucleic acids in an array 
format, a panel of mutations may be simultaneously screened for a number of genetic loci (i.e. , multiplexing). 
[0214] In addition, using less stringent washing conditions the hybridized DNA strand may be analyzed by mass 
spectrometry for changes in the mass resulting from a deletion or insertion within the targeted restriction endonuclease 
fragment. 

25 

Primer oligonucleotide base extension 

[021 5] As described in detail in the following Example 1 1 , the primer oligo base extension (PROBE) method combined 
with mass spectrometry identifies the exact number of repeat units ( i.e. the number of nucleotides in homogenous 
30 stretches) as well as second site mutations within a polymorphic region, which are otherwise only detectable by se- 
quencing. Thus, the PROBE technique increases the total number of detectable alleles at a distinct genomic site, 
leading to a higher polymorphism information content (PIC) and yielding a far more definitive identification in for instance 
statistics-based analyses in paternity or forensics applications. 

[0216] The method is based on the extension of a detection primer that anneals adjacent to a variable nucleotide 
35 tandem repeat (VNTR) or a polymorphic mononucleotide stretch using a DNA polymerase in the presence of a mixture 
of deoxyNTPs and those dideoxyNTPs that are not present in the deoxy form. The resulting products are evaluated 
and resolved by MALDI-TOF mass spectrometry without further labeling of the DNA. In a simulated routine application 
with 28 unrelated individuals, the mass error of this procedure using external calibration was in the worst case 0.38% 
(56-mer), which is comparable to approximately 0.1 base accuracy; routine standard mass deviations are in the range 
40 of 0.1% (.03 bases). Such accuracy with conventional electrophoretic methods is not realistic, underscoring the value 
of PROBE and mass spectrometry in forensic medicine and paternity testing. 

[0217] The ultra-high resolution of Fourier Transform mass spectrometry makes possible the simultaneous meas- 
urement of all reactions of a Sanger or Maxam Gilbert sequencing experiment, since the sequence may be read from 
mass differences instead of base counting from 4 tubes. 

45 [0218] Additionally, the mass differences between adjacent bases generated from unilateral degradation in a step- 
wise manner by an exonuclease can be used to read the entire sequence of fragments generated. Whereas UV or 
fluorescent measurements will not discriminate mixtures of the nucleoside/nucleotide which are generated when the 
exonuclease enzyme gets out of phase, this is no problem with mass spectrometry since the resolving power in differ- 
entiating between the molecular mass of dA, dT, dG and dC is more than significant. The mass of the adjacent bases 

50 ( i.e. , nucleotides) can be determined, for example, using Fast Atomic Bombardment (FAB) or Electronspray Ionization 
(ESI) mass spectrometry. 

[0219] New mutation screening over an entire amplified product can be achieved by searching for mass shifted 
fragments generated in an endonuclease digestion as described in detail in the following Examples 4 and 12. 
[0220] Partial sequence information obtained from tandem mass spectrometry (MS") can place composition con- 
55 straints as described in the preceding paragraph. For the 25-mer above, generation of two fragment ions formed by 
collisionally activated dissociation (GAD) which differ by 313 Da discounts T4G12G9, which contains no A nucleotides; 
confirming more than a single A eliminates AT3C4G16 as a possible composition. 

[0221] MS" can also be used to determined full or partial sequences of larger DNAs; this can be used to detect. 
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locate, and identify new mutations in a given gene region. Enzymatic digest products wliose masses are correct need 
not be furtlier analyzed; those with mass shifts could be isolated in real time from the complex mi)cture in the mass 
spectrometer and partially sequenced to locate the new mutation. 

[0222] Table I describes the mutation/polymorphism detection tests that have been developed. 

5 


Table I 


15 


Mutation/Polymorphism Detection Tests 

Clinical Association 

Gene 

h/l utatio n/Po iy mo r p h is m 

Cystic Fibrosis 

CFTR 

38 disease causing mutations in 14 exons/introns 

Heart Disease (Cholesterol 
Metabolism) 

Apo E 
Apo A- IV 
Apo B-100 

112R, 112C, 158R, 158C 

o^~70 nA~7^ ocrhi— 1 oen/^ 
o4/C3, o4/ 1, obUn, JbULl 

3500Q, 3500R 

Thyroid Cancer 

RET proto- 
oncogene 

C634W, C634T, C634R, 
C634S, C634F 

Sickle Cell Anemia/ Thalassemia 

beta-globin 

Sickle cell anemia S and C 45 thalassemia alleles 

HIV Susceptibility 

CKR-5 

32bp deletion 

Breast Cancer Susceptibility 

BRCA-2 

2bp (AG) deletion in exon 2 

Thrombosis 

Factor V 

R506Q 

Arteriosclerosis 

Gpllla E-selectin 

L33P S128R 

Hypertension 

ACE 

l/D polymorphism 


Detection of mutations 


Diagnosis of genetic diseases 

30 

[0223] The mass spectrometric processes described above can be used, for example, to diagnose any of the more 
than 3000 genetic diseases currently known ( e.g. , hemophilias, thalassemias, Duchenne Muscular Dystrophy (DMD), 
Huntington's Disease (HD), Alzheimer's Disease and Cystic Fibrosis (CF)) or to be identified. 

[0224] The following Examples provides a mass spectrometric method for detecting a mutation (AF508) of the cystic 
fibrosis transmembrane conductance regulator gene (CFTR), which differs by only three base pairs (900 daltons)from 
the wild type of CFTR gene. As described further in Example 3, the detection is based on a single-tube, competitive 
oligonucleotide single base extension (COSBE) reaction using a pair of primers with the 3'-terminal base complemen- 
tary to eitherthe normal or mutant allele. Upon hybridization and addition of a polymerase and the nucleoside triphos- 
phate one base downstream, only those primers properly annealed (i..e, no S'-temninal mismatch) are extended; prod- 
ucts are resolved by molecular weight shifts as determined by matrix assisted laser desorption ionization time-of-flight 
mass spectrometry. For the cystic fibrosis AF508 polymorphism, 28-mer 'normal' (N) and 30-mer 'mutant' (M) primers 
generate29- and 31 -mersfor N and M homozygotes, respectively, and both for heterozygotes. Since primer and product 
molecular weights are relatively low (< 1 0 kDa) and the mass difference between these are at least that of a single ~ 
300 Da nucleotide unit, low resolution instrumentation Is suitable for such measurements. 

[0225] Thermosequence cycle sequencing, as further described in Example 1 1 , is also useful for detecting a genetic 
disease. 

[0226] In addition to mutated genes, which result in genetic disease, certain birth defects are the result of chromo- 
somal abnormalities such as Trisomy 21 (Down's Syndrome), Trisomy 13 (Patau Syndrome), Trisomy 18 (Edward's 
Syndrome), Monosomy X (Turner's Syndrome) and other sex chromosome aneuploidies such as Klienfelter's Syn- 
^° drome (XXY). Here, "house-keeping" genes encoded by the chromosome in question are present in different quantity 
and the different amount of an amplified fragment compared to the amount in a normal chromosomal configuration can 
be determined by mass spectrometry. 

[0227] Further, there is growing evidence that certain DMA sequences may predispose an individual to any of a 
number of diseases such as diabetes, arteriosclerosis, obesity, various autoimmune diseases and cancer ( e.g. , color- 
ectal, breast, ovarian, lung). Also, the detection of "DNA fingerprints", e.g. , polymorphisms, such as "mini- and micro- 
satellite sequences", are useful for determining identity or heredity (e.g. , paternity or maternity). 
[0228] The following Examples 4 and 12 provide mass spectrometer based methods for identifying any of the three 
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different isofomns of human apolipoprotein E, which are coded by the E2, E3 and E4 alleles. For example, the molecular 
weights of DNA fragments obtained after restriction with appropriate restriction endonucleases can be used to detect 
the presence of a mutation and/or a specific allele. 

[0229] Depending on the biological sample, the diagnosis for a genetic disease, chromosomal aneuploidy or genetic 
5 predisposition can be preformed either pre- or post-natally. 

Diagnosis of cancer 

[0230] Preferred mass spectrometer-based methods for providing an early indication of the existence of a tumor or 
10 a cancer are provide herein. For example, as described in Example 13, the telomeric repeat amplification protocol 
(TRAP) in conjunction with telomerase specific extension of a substrate primer and a subsequent amplification of the 
telomerase specific extension products by an amplification step using a second primer complementary to the repeat 
structure was used to obtain extension ladders, that were easily detected by MALDI-TOF mass spectrometry as an 
indication of telomerase activity and therefor tumorigenesis. 
15 [0231] Alternatively, as described In Example 14, expression of a tumor or cancer associated gene ( e.g. , human 
tyrosines-hydroxylase) via RT-PCR and analysis of the amplified products by mass spectrometry can be used to detect 
the tumor or cancer ( e.g. , biosynthesis of catecholamine via tyrosine 5-hydroxylase is a characteristic of neuroblasto- 
ma). 

[0232] Further, a primer oligo base extension reaction and detection of products by mass spectrometry provides a 
20 rapid means for detecting the presence of oncogenes, such as the RET proto oncogene codon 634, which Is related 
to causing multiple endocrine neoplasia, type II (MEN II), as described in Example 15. 

Diagnosis of infection 

25 [0233] Viruses, bacteria, fungi and other infectious organisms contain distinct nucleic acid sequences, which are 
different from the sequences contained in the host cell. Detecting or quantitating nucleic acid sequences that are 
specific to the infectious organism is important for diagnosing or monitoring infection. Examples of disease causing 
viruses that infect humans and animals and which may be detected by the disclosed processes include: Retroviridae 
( e.g. , human immunodeficiency viruses, such as HIV-1 (also referred to as HTLV-III, LAV or HTLV-III/LAV, see, e.g. , 

30 Ratner et al. (1985) Nature 313 : 227-284; Wain-Hobson et aL (1985) Cell 40:9-17); HIV-2 (see, Guyader et al (1987) 
Nature 328 :662-669 European Patent Publication No. 0 269 520; Chakrabarti et al. (1 987) Nature 328:543-547; and 
European Patent Application No. 0 655 501); and other isolates, such as HIV-LP (International PCT application No. 
WO 94/00562 entitled "A Novel Human Immunodeficiency Virus"; Picornaviridae (e.g. , polio viruses, hepatitis A virus, 
(see, e.g. , Gustetal. (1983) I ntervirology 20 :1-7); enteroviruses, human coxsackieviruses, rhinoviruses, echoviruses); 

35 Calciviridae ( e.g. , strains that cause gastroenteritis); Togaviridae (e.g. , equine encephalitis viruses, rubella viruses); 
Flaviridaej e.Q. , dengue viruses, encephalitis viruses, yellowfevervlruses); Coronaviridae ( e.g. , coronaviruses); Rhab- 
doviridae ( e.g. , vesicular stomatitis viruses, rabies viruses); Fifovirrdae (e.g. , ebola viruses); Paramyxoviridae ( e.g. , 
parainfluenza viruses, mumps virus, measles virus, respiratory syncytial virus); Ortfiomyxoviridae (e.g. , influenza vi- 
ruses); Bungaviridae (e.g. , Hantaan viruses, bunga viruses, phleboviruses and Nairo viruses); Arena viridae (hemor- 

40 rhagicfever viruses); Reovlrldae (e.g. , reoviruses, orbiviruses and rotaviruses); Blrnaviridae; Hepadnaviridae (Hepatitis 
B virus); Parvovin'dae (parvoviruses); Papovaviridae (papilloma viruses, polyoma viruses); Adenoviridae (most aden- 
oviruses); Herpesviridae (herpes simplex virus (HSV) 1 and 2, varicella zoster virus, cytomegaovirus (CMV), herpes 
viruses'); Poxviridae (variola viruses, vaccinia viruses, pox viruses); and iridovin'dae (e.g. , African swine fever virus); 
and unclassified viruses (e.g. , the etiological agents of Spongiform encephalopathies, the agent of delta hepatitis 

45 (thought to be a defective satellite of hepatitis B virus), the agents of non-A, non-B hepatitis (class 1 ^ internally trans- 
mitted; class 2 = parenterally transmitted (i.e.. Hepatitis C); Norwalk and related viruses, and astrovi ruses). 
[0234] Examples of infectious bacteria include, but are not limited to: Helicobacter pytoris, Borelia burgdorferi, Le- 
gionella pneumophilia, Mycobacteria sps ( e.g. , fi/l. tuberculosis, M. avium, M. intracelluiare, M. I<ansaii, gordonae), 
Staphylococcus aureus. Neisseria gonorrhoeae, Neisseria meningitidis, Listeria monocytogenes, Streptococcus pyo- 

50 genes (Group A Streptococcus), Streptococcus agafactlae (Group B Streptococcus), Streptococcus (viridans group). 
Streptococcus faecaiis, Streptococcus bovis, Streptococcus (anaerobic sps.). Streptococcus phhenumoniae, patho- 
genic Campylobacter sp., Enterococcussp., Haemophilus influenzae, Bacillus antracis, corynebacterium diphtheriae, 
corynebacteriumsp., Erysipelothhx rhuslopathlae, Clostridium perfrlngers, Clostridium tetani, Enterobacter aerogenes, 
Klebsiella pneumoniae, Pasturella multocida, Bacteroides sp., Fusobacterium nucleatum, Streptobacillus maniliformis, 

55 Treponema pallidium, Treponema pertenue, Leptospira, and Actinomyces israelii. 

[0235] Examples of infectious fungi include: Cryptococcus neoformans, Histoplasma capsulatum, Coccidioides im- 
mitis, Blastomyces dermatitidis, Chlamydia trachomatis, Candida albicans. Other infectious organisms (i.e., protists) 
include: Plasmodium falciparum and Toxoplasma gondii. 
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[0236] The processes provided herein makes use of the known sequence information of the target sequence and 
known mutation sites. Although new mutations can also be detected. For example, as shown in FIGURE 8, transcription 
of a nucleic acid molecule obtained from a biological sample can be specifically digested using one or more nucleases 
and the fragments captured on a solid support carrying the corresponding complementary nucleic acid sequences. 
5 Detection of hybridization and the molecular weights of the captured target sequences provide information on whether 
and where in a gene a mutation is present. Alternatively, DNA can be cleaved by one or more specific endonucleases 
to form a mixture of fragments. Comparison of the molecular weights between wildtype and mutant fragment mixtures 
results in mutation detection. 

10 Sequencing by generation of specifically terminated fragements 

[0237] In another embodiment, an accurate sequence determination of a relatively large target nucleic acid, can be 
obtained by generating specifically tenninated fragments from the target nucleic acid, determining the mass of each 
fragment by mass spectrometry and ordering the fragments to determine the sequence of the larger target nucleic 
15 acid. In a preferred embodiment, the specifically terminated fragments are partial or complete base-specifically termi- 
nated fragments. 

[0238] Onemethodforgeneratlng base specificallytenninated fragments involves using a base-specific ribonuclease 
after e.g. , a transcription reaction. Preferred base-specific ribonucleases are selected from among: T^-ribonuclease 
(G-specific), U2-ribonuclease (A-specif ic) , PhyM-ribonuclease U specific and ribonuclease A (U/G specific). Other ef- 

20 ficient and base-specific ribonucleases can be identified using the assay described in Example 21 . Preferably modified 
nucleotides are included in the transcription reaction with unmodified nucleotides. Most preferably, the modified nucle- 
otides and unmodified nucleotides are added to the transcription reaction at appropriate concentrations, so that both 
moieties are incorporated at a preferential rate of about 1 :1 . Alternatively, two separate transcriptions of the target DNA 
sequence one with the modified and one with the unmodified nucleotides can be performed and the results compared. 

25 Preferred modified nucleotides include: boron or bromine modified nucleotides (Porter etaL (1 995) Biochemistry 34 : 
11963-11969; Hasan etaL (1 996) Nucl. Acids Res. 24:2150-2157; Li et al. (1995) Nucleic Acids Res. 23:4495-4501), 
a-thio-modified nucleotides, as well as mass-modified nucleotides as described above. 

[0239] Another method for generating base specifically terminated fragments involves performing a combined am- 
plification and base-specific termination reaction. For example, a combined amplification and termination reaction can 
30 be performed using at least two different polymerase enzymes, each having a different affinity for the chain terminating 
nucleotide, so that polymerization by an enzyme with relatively low affinity for the chain terminating nucleotide leads 
to exponential amplification whereas an enzyme with relatively high affinity for the chain terminating nucleotide termi- 
nates the polymerization and yields sequencing products. 

[0240] The combined amplification and sequencing can be based on any amplification procedure that employs an 

35 enzyme with polynucleotide synthetic ability ( e.g. , polymerase). One preferred process, based on the polymerase chain 
reaction (PGR), includes the following three themnal steps: 1) denaturing a double stranded (ds) DNA molecule at an 
appropriate temperature and for an appropriate period of time to obtain the two single stranded (ss) DNA molecules 
(the template: sense and antisense strand); 2) contacting the template with at least one primer that hybridizes to at 
least one ss DNA template at an appropriate temperature and for an appropriate period of time to obtain a primer 

40 containing ss DNA template; 3) contacting the primer containing template at an appropriate temperature and for an 
appropriate period of time with: (i) a complete set of chain elongating nucleotides, (ii) at least one chain terminating 
nucleotide, (ill) a first DNA polymerase, which has a relatively low affinity towards the chain terminating nucleotide; 
and (iv) a second DNA polymerase, which has a relatively high affinity towards the chain terminating nucleotide. 
[0241] Steps 1)-3) can be sequentially performed for an appropriate number of times (cycles) to obtain the desired 

45 amount of amplified sequencing ladders. The quantity of the base specifically terminated fragment desired dictates 
how many cycles areperfomned. Although an increased number of cycles results in an increased level of amplification, 
it may also detract from the sensitivity of a subsequent detection. It is therefore generally undesirable to perform more 
than about 50 cycles, and is more preferable to perform less than about 40 cycles (e.g. , about 20-30 cycles). 
[0242] Another preferred process for simultaneously amplifying and chain terminating a nucleic acid sequence is 

50 based on strand displacement amplification (SDA) (see, e.g. . Walker et al. (1994) Nucl. Acids Res. 22 :2670-77; Euro- 
pean Patent Publication Number 0 684 315 entitled "Strand Displacement Amplification Using Thermophilic Enzymes"). 
In essence, this process involves the following three steps, which altogether constitute a cycle: 1) denaturing a double 
stranded (ds) DNA molecule containing the sequence to be amplified at an appropriate temperature and for an appro- 
priate period of time to obtain the two single stranded (ss) DNA molecules (the template: sense and antisense strand); 

55 2) contacting the template with at least one primer (P), that contains a recognition/cleavage site for a restriction endo- 
nuclease (RE) and that hybridizes to at least one ss DNA template at an appropriate temperature and for an appropriate 
period of time to obtain a primer containing ss DNA template; 3) contacting the primer containing template at an ap- 
propriate temperature and for an appropriate period of time with (i) a complete set of chain elongating nucleotides; (ii) 
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at least one chain terminating nucleotide; (iii) a first DNA polymerase, which has a relatively low affinity towards the 
chain terminating nucleotide; (iv) a second DNA polymerase, which has a relatively high affinity towards the chain 
ternnlnating nucleotide; and (v) an RE that nicks the primer recognition/cleavage site. 

[0243] Steps 1)-3) can be sequentially performed for an appropriate number of times (cycles) to obtain the desired 
5 amount of amplified sequencing ladders. As with the PGR based process, the quantity of the base specifically termi- 
nated fragment desired dictates how many cycles are performed. Preferably, less than 50 cycles, more preferably less 
than about 40 cycles and most preferably about 20 to 30 cycles are performed. 

[0244] Preferably about 0.5 to about 3 units of polymerase is used in the combined amplification and chain termination 
reaction. Most preferably about 1 to 2 units is used. Particularly preferred polymerases for use in conjunction with PGR 

10 or other thermal amplification process are thermostable polymerases, such asTaq DNA polymerase (Boehringer Man- 
nheim), AmpliTaq FS DNA polymerase (Perkin-Elmer), Deep Vent (exo-). Vent, Vent (exo-) and Deep Vent DNA 
polymerases (New England Biolabs), Thermo Sequenase (Amersham) or exo(-) Pseudococcus fuhosus (Pfu) DNA 
polymerase (Stratagene, Heidelberg, Germany). AmpliTaq, Ultman, 9 degree Nm, Tth, Hot Tub, and Pyrococcus fuh- 
osus. In addition, preferably the polymerase does not have 5'-3' exonuclease activity 

15 [0245] In addition to polymerases, which have a relatively high and a relatively low affinity to the chain terminating 
nucleotide, a third polymerase, which has proofreading capacity (e.g. , Pyrococcus woesef (PwoJ) DNA polymerase 
may also be added to the amplification mixture to enhance the fidelity of amplification. 

[0246] Yet another method for generating base specifically terminated fragments involves contacting an appropriate 
amount of the target nucleic acid with a specific endonuclease or exonuclease. Preferably, the original 5' and/or 3' end 

20 of the nucleic acid is tagged to facilitate the ordering of fragments. Tagging of the 3' end is particularly preferred when 
in vitro nucleic acid transcripts are being analyzed, so that the influence of 3' heterogeneity, premature termination and 
nonspecific elongation can be minimized. 5' and 3' tags can be natural ( e.g. , a 3' poly A tail or 5' or 3' heterogeneity) 
or artificial . Preferred 5' and/or 3' tags are selected from among the molecules descri bed for mass-mod if ication above. 
[0247] The methods provided herein are further illustrated by the following examples, which should not be construed 

25 as limiting in any way. 

EXAMPLE 1 

MALDI-TOF desorptlon of oligonucleotides directly on solid supports 

30 

[0248] 1 g CPG (Controlled Pore Glass) was functionalized with 3-(triethoxysilyl)-epoxypropan to form OH-groups 
on the polymer surface. Astandard oligonucleotide synthesis with 13 mg of theOH-CPG on a DNA synthesizer (Milligen, 
Model 7500) employing p-cyanoethyl-phosphoamidites (Koster et aL (1994) Nucleic Acids Res. 12 :4539) and TAG N- 
protecting groups (Koster etaL (1 981 ) Tetrahedron 37 :362) was performed to synthesize a S'-Tj-SOmer oligonucleotide 

35 sequence in which 50 nucleotides are complementary to a "hypothetical" 50mer sequence. T5 serves as a spacer. 
Deprotectlon with saturated ammonia in methanol at room temperature for 2 hours furnished according to the deter- 
mination of the DMT group CPG which contained about 10 umol 55mer/g CPG. This 55mer served as a template for 
hybridizations with a 26-mer (with 5'-DMT group) and a 40-mer (without DMT group). The reaction volume is 100 |il 
and contains about 1 nmol CPG bound 55mer as template, an equimolar amount of oligonucleotide in solution (26-mer 

40 or 40-mer) in 20 mM Tris-HCI, pH 7.5, 10 mM MgCl2 and 25 mM NaCI. The mixture was heated for 10 min at 65''C 
and cooled to 37'*C during 30' (annealing). The oligonucleotide which has not been hybridized to the polymer-bound 
template were removed by centrifugation and three subsequent washing/centrifugation steps with 100 ul each of ice- 
cold 50 mM ammoniumcitrate. The beads were air-dried and mixed with matrix solution (3-hydroxypicolinic acid/1 OmM 
ammonium citrate in acetonitrile/water, 1:1), and analyzed by MALDI-TOF mass spectrometry. The results are pre- 

45 sented in Figures 10 and 11. 

EXAMPLE 2 

Electrospray (ES) desorptlon and differentiation of an 18-mer and 19-mer 

50 

[0249] DNA fragments at a concentration of 50 pmole/ul in 2-propanol/10mM ammoniumcarbonate (1/9, v/v) were 
analyzed simultaneously by an electrospray mass spectrometer 

[0250] The successful desorptlon and differentiation of an IS-mer and 1 9-mer by electrospray mass spectrometry 
is shown in FIGURE 12. 

55 
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EXAMPLE 3 

Detection of The Cystic Fibrosis Mutation AF508, by single step dideoxy extension and analysis by MALDI- 
TOF mass spectrometry (Competitive Oligonucleotide Simple Base Extension = COSBE) 

5 

[0251] Tlie principle of the COSBE metlioci issliown in FIGURE 13, N being the normal and M the mutation detection 
primer, respectiveiy. 

MATERIALS AND METHODS 

10 

[0252] PCR Amplification and Strand Immobiiization. Amplification was carried out with exon 10 specific primers 
using standard PCR conditions (30 cycles: 1'@95**C, 1'@55**C, 2'@72*'C); the reverse primer was 5' labelled with 
biotin and column purified (Oligopurification Cartridge, Cruachem). After amplification the amplified products were 
purified by column separation (Qiagen Quickspin) and immobilized on streptavidin coated magnetic beads (Dynabeads, 
15 Dynal, Norway) according to their standard protocol; DNA was denatured using 0.1 M NaOH and washed with 0.1 M 
NaOH, IxB+W buffer and TE buffer to remove the non-biotinylated sense strand. 

[0253] COSBE Conditions. The beads containing ligated antisense strand were resuspended in 18^Ll of Reaction 
mix 1 (2 1^1 10X Taq buffer, 1 |j.L (1 unit) Taq Polymerase, 2 [iL of 2 mM dGTP, and 13 |xL H2O) and incubated at SO'C 
for 5' before the addition of Reaction mix 2 (1 00 ng each of COSBE primers). The temperature was reduced to 60'C 
20 and the mixtures incubatedfor a 5' annealing/extension period; the beads were then washed in 25mM triethylammonium 
acetate (TEAA) followed by 50mM ammonium citrate. 

[0254] Primer Sequences. All primers were synthesized on a Perseptive Biosystems Expedite 8900 DNA Synthesizer 
using conventional phosphoramidite chemistry (Sinhaet al. (1 984) Nucleic Acids Res. 12 :4539). COSBE primers (each 
containing an intentional mismatch one base before the 3'-terminus) were those used in a previous ARMS study (Ferrie 
25 et al. (1992) Am J Hum Genet 51^:251 -262) with the exception that two bases were removed from the 5'-end of the 
normal: 

Exi 0 PCR (Forward): 5'-BI0-GCA AGT GAA TCC TGA GCG TG-3' (SEQ ID No. 1) 
Exi 0 PCR (Reverse): 5'-GTG TGA AGG GTT CAT ATG C-3' (SEQ ID No. 2) 
30 COSBE AF508-N 5'-ATC TAT ATT CAT CAT AGG AAA CAC CAC A-3' (28-mer) (SEQ ID No. 3) 

COSBE AF508-N 5'-GTA TCT ATA TTC ATC ATA GGA AAC ACC ATT-3' (30-mer) (SEQ ID No. 4) 

[0255] MsLSS Spectrometry. After washing, beads were resuspended in 1 |a,L 1 8 Mohm/cm H2O. 300 nL each of matrix 
(Wu et al. (1993) Rapid Commun. Mass Spectrom. 7:142-146) solution (0.7 M 3-hydroxypicolinic acid, 0.7 M dibasic 

35 ammonium citrate in 1 :1 H20:CH3CN) and resuspended beads (Tang et al. (1995) Rapid Commun Mass Spectrom 8 : 
727-730) were mixed on a sample target and allowed to air dry. Up to 20 samples were spotted on a probe target disk 
for introduction Into the source region of an unmodified Thermo Bioanalysis (formerly Finnigan) Visions 2000 MALDI- 
TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical 
average molecular weights (Mr(calc)) were calculated from atomic compositions. Vendor provided software was used 

40 to determine peak centroids using external calibration; 1 .08 Da has been subtracted from these to correct for the charge 
carrying proton mass to yield the text Mr(exp) values. 

[0256] Scheme. Upon annealing to the bound template, the N and M primers (8508.6 and 9148.0 Da, respectively) 
are presented with dGTP; only primers with proper Watson-Crick base paring at the variable (V) position are extended 
by the polymerase. Thus if V pairs with the 3'-terminal base of N, N is extended to a 8837.9 Da product (N + 1 ). Likewise, 
45 if V is properly matched to the M terminus, M Is extended to a 9477.3 Da M -1- product. 

Results 

[0257] Figures 14-18 show the representative mass spectra of COSBE reaction products. Better results were ob- 
50 tained when amplified products were purified before the biotinylated anti-sense strand was bound. 

EXAMPLE 4 

Differentiation of IHuman Apolipoprotein E Isoforms by IVIass Spectrometry 

55 

[0258] Apolipoprotein E (Apo E), a protein component of lipoproteins, plays an essential role in lipid metabolism. For 
example, it is involved with cholesterol transport, metabolism of lipoprotein particles, immunoregulation and activation 
of a number of lipolytic enzymes. 
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[0259] There are three common isofomns of human Apo E (coded by E2, E3 and E4 alleles). The most common is 
the E3 allele. The E2 allele has been shown to decrease the cholesterol level in plasma and therefore may have a 
protective effect against the development of atherosclerosis. The DMA encoding a portion of the E2 allele is set forth 
in SEQ ID No. 130. Finally, the E4 isoform has been correlated with increased levels of cholesterol, conferring predis- 
5 position to atherosclerosis. Therefore, the identity of the apo E allele of a particular individual is an important determinant 
of risk for the development of cardiovascular disease. 

[0260] As shown in Figure 1 9, a sample of DNA encoding apolipoprotein E can be obtained from a subject, amplified 
( e.g. , via PGR); and the amplified product can be digested using an appropriate enzyme ( e.g. , Cfol). The restriction 
digest obtained can then be analyzed by a variety of means. As shown in Figu re 20, the three isotypes of apolipoprotein 
10 E (E2, E3 and E4 have different nucleic acid sequences and therefore also have distinguishable molecular weight 
values. 

[0261] As shown in Figure 21 A-C, different Apolipoprotein E genotypes exhibit different restriction patterns in a 3.5% 
MetPhor Agarose Gel or 12% polyacrylamlde gel. As shown In Figures 22 and 23, the various apolipoprotein E geno- 
types can also bo accurately and rapidly determined by mass spectrometry. 

15 

EXAMPLE 5 

Detection of hepatitis B virus in serum samples. 
20 MATERIALS AND METHODS 

Sample preparation 

[0262] Phenol/choloform extraction of viral DNA and the final ethanol precipitation was done according to standard 
25 protocols. 

First PCR 

[0263] Each reaction was performed with 5[il of the DNA preparation from serum. 15pmolof each primer and 2 units 
30 Taq DNA polymerase (Perkin Elmer, Weiterstadt, Germany) were used. The final concentration of each dNTP was 
200p.MM, the final volume of the reaction was 50 i^l. 1 0x PCR buffer (Perkin Elmer, Weiterstadt, Germany) contained 
100 mM Tris-HCI, pH 8.3, 500 mM KCI, 15 mM MgCIa, 0.01% gelatine (w/v). Primer sequences: 

Primer SEQUENCE SEQ ID No. 

1 5'-GCTTTGGGGCATGGACATTGACCCGTATAA-3' 5 

2 5'-CTGACTACTAATTCCCTGGATGCTGGGTCT-3' 6 

40 

Nested PCR: 

[0264] Each reaction was performed either with 1 |a.l of the first reaction or with a 1 :10 dilution of the first PCR as 
45 template, respectively. 1 00 pmol of each primer, 2.5 u Pfu[exQ-) DNA polymerase (Stratagene, Heidelberg, Gemnany), 
a final concentration of 200 |jM of each dNTPs and 5 ^il lOx Pfu buffer (200 mM Tris-HCI, pH 8.75, 100 mM KCI, 100 
mM (NH4)2S04, 1% Triton X-10G, 1 mg/ml BSA, (Stratagene, Heidelberg, Germany) were used in a final volume 50 
|al. The reactions were performed in a themnocycler (OmniGene, MWG-Biotech, Ebersberg, Germany) using the fol- 
lowing program: 92**C for 1 minute, eCC for 1 minute and 72'C for 1 minute with 20 cycles. Sequence of oligodeox- 
50 ynucleotides (purchased HPLC-purified from MWG-Biotech, Ebersberg, Germany): 

HBV13: 5'-TTGCCTGAGTGCAGTATGGT-3' (SEQ ID NO. 7) 

„ HBVISbio: Biotin-5'-AGCTCTATATCQGQAAQCCT-3' (SEQ ID NO. 8) 
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Purification of amplified products: 

[0265] For the recording of each spectrum, one PGR, 50 [xl, (performed as described above) was used. Purification 
was done according to the following procedure: Ultrafiltration was done using Ultrafree-MC filtration units (Millipore, 

5 Eschborn, Germany) according to the protocol of the provider with centrifugation at 8000 rpm for 20 minutes. 25p.l 
(10jag/|al) streptavidin Dynabeads (Dynal, Hamburg, Germany) were prepared according to the instructions of the man- 
ufacturer and resuspended in 25|a.l of B/W buffer (10 mM Tris-HCI, pH 7.5, 1mM EDTA, 2 M NaCI). This suspension 
was added to the PGR samples still in the filtration unit and the mixture was incubated with gentle shaking for 15 
minutes at ambient temperature. The suspension was transferred in a 1 .5 ml Eppendorf tube and the supernatant was 

10 removed with the aid of a Magnetic Particle Collector, MPC, (Dynal, Hamburg, Germany). The beads were washed 
twice with 50 |il of 0.7 M ammonium citrate solution, pH 8.0 (the supernatant was removed each time using the MPG). 
Gleavage from the beads can be accomplished by using formamide at QCG. The supernatant was dried in a speedvac 
for about an hour and resuspended in 4 |i.l of ultrapure water (MilliQ UF plus Millipore, Eschborn, Germany). This 
preparation was used for MALDI-TOF MS analysis. 

15 

MALDI-TOF MS: 

[0266] Half a microliter of the sample was pipetted onto the sample holder, then immediately mixed with 0.5 [il matrix 
solution (0.7 M3-hydroxypicolinic acid 50% acetonitrile, 70 mM ammonium citrate). This mixture was dried at ambient 

20 temperature and introduced into the mass spectrometer. All spectra were taken in positive ion mode using a Finnigan 
MAT Vision 2000 (Finnigan MAT, Bremen, Germany), equipped with a reflectron (5 keV ion source, 20 keV postaccel- 
eration) and a 337 nm nitrogen laser. Calibration was done with a mixture of a 40-mer and a 100-mer. Each sample 
was measured with different laser energies. In the negative samples, the amplified product was detected neither with 
less nor with higher laser energies. In the positive samples the amplified product was detected at different places of 

25 the sample spot and also with varying laser energies. 

RESULTS 

[0267] A nested PGR system was used for the detection of HBV DNA in blood samples employing oligonucleotides 

30 complementary to the c region of the HBV genome (primer 1 : beginning at map position 1 763, primer 2 beginning at 
map position 2032 of the complementary strand) encoding the HBV core antigen (HBVcAg). DNA was isolated from 
patients serum according to standard protocols. A first PGR was performed with the DNA from these preparations 
using a first set of primers. If HBV DNA was present in the sample a DNA fragment of 269 bp was generated. 
[0268] In the second reaction, primers which were complementary to a region within the PGR fragment generated 

35 in the first PGR were used. If HBV related amplified products were present in the first PGR a DNA fragment of 67 bp 
was generated (see Fig. 25A) In this nested PGR. The usage of a nested PGR system for detection provides a high 
sensitivity and also serves as a specificity control for the external PGR (Rolfs et al. (1992) PGR: Clinical Diagnostics 
and Research, Springer, Heidelberg). A further advantage is that the amount of fragments generated in the second 
PGR is high enough to ensure an unproblematic detection although purification losses can not be avoided. 

40 [0269] The samples were purified using ultrafiltration to restreptavidin Dynabeads. This purification was done be- 
cause the shorter primer fragments were immobilized in higher yield on the beads due to stearic reasons. The immo- 
bilization was done directly on the ultrafiltration membrane to avoid substance losses due to unspecific absorption on 
the membrane. Following immobilization, the beads were washed with ammonium citrate to perform cation exchange 
(Pieles et aL (1 993) Nucl. Acids Res. 21_:31 91 -31 96). The immobilized DNA was cleaved from the beads using 25% 

45 ammonia which allows cleavage of DNA from the beads In a very short time, but does not result In an introduction of 
sodium or other cations. 

[0270] The nested PCRs and the MALDI TOF analysis were perfomned without knowing the results of serological 
analysis. Due to the unknown virus titer, each sample of the first PGR was used undiluted as template and in a 1 :1 0 
dilution, respectively. 

50 [0271] Sample 1 was collected from a patient with chronic active HBV infection who was positive in Hbs- and Hbe- 
antigen tests but negative in a dot blot analysis. Sample 2 was a serum sample from a patient with an active HBV 
infection and a massive viremia who was HBV positive in a dot blot analysis. Sample 3 was a denatured serum sample 
therefore no serological analysis could be performed by an increased level of transaminases indicating liver disease 
was detected. In autoradiograph analysis (Figure 24), the first PGR of this sample was negative. Nevertheless, there 

55 was some evidence of HBV infection. This sample is of interest for MALDI-TOF analysis, because it demonstrates that 
even low-level amounts of amplified products can be detected after the purification procedure. Sample 4 was from a 
patient who was cured of HBV infection. Samples 5 and 6 were collected from patients with a chronic active HBV 
infection. 
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[0272] Figure 24 shows the results of a PAGE analysis of the nested PGR reaction. A amplified product is clearly 
revealed in samples 1, 2, 3, 5 and 6. In sample 4 no amplified product was generated, it is indeed HBV negative, 
according to the serological analysis. Negative and positive controls are indicated by + and -, respectively. Amplification 
artifacts are visible in lanes 2, 5, 6 and + if non-diluted template was used. These artifacts were not generated if the 
5 template was used in a 1 :10 dilution. In sample 3, amplified product was merely detectable if the template was not 
diluted. The results of PAGE analysis are in agreement with the data obtained by serological analysis except for sample 
3 as discussed above. 

[0273] Figure 25A shows a mass spectrum of a nested amplified product from sample number 1 generated and 
purified as described above. The signal at 20754 Da represents the single stranded amplified product (calculated; 
10 20735 Da, as the average mass of both strands of the amplified product cleaved from the beads). The mass difference 
of calculated and obtained mass is 19 Da (0.09%). As shown in Fig. 25A, sample number 1 generated a high amount 
of amplified product, resulting in an unambiguous detection. 

[0274] Fig. 25B shows a spectrum obtained from sample number 3. As depicted in Fig. 24, the amount of amplified 
product generated in this section is significantly lower than that from sample number 1 . Nevertheless, the amplified 
15 product is clearly revealed with a mass of 20751 Da (calculated 20735). The mass difference is 16 Da (0.08%). The 
spectrum depicted in Fig. 25C was obtained from sample number 4 which is HBV negative (as is also shown in Fig 
24). As expected no signals corresponding to the amplified product could be detected. All samples shown in Fig. 25 
were analyzed with MALDI-TOF MS, whereby amplified product was detected in all HBV positive samples, but not in 
the HBV negative samples. These results were reproduced in several independent experiments. 

20 

EXAMPLE 6 

Analysis of Ligase Chain Reaction Products Via MALDI-TOF Mass Spectrometry 
25 MATERIALS AND METHODS 

Oilgodeoxynucieotldes 

[0275] Except the biotinylated one and all other oligonucleotides were synthesized in a 0.2 \\xr\Q\ scale on a MilliGen 
30 7500 DNA Synthesizer (Millipore, Bedford, MA, USA) using the p-cyanoethylphosphoamidite method (Sinha, N.D. et 
al. (1984) Nucleic Acids Res. 12:4539-4577). The oligodeoxynucleotides were RP-HPLC-pu rifled and deprotected 
according to standard protocols. The biotinylated oligodeoxynucleotide was purchased (HPLC-purified) from Biometra, 
Gottingen, Germany). Sequences and calculated masses of the oligonucleotides used: 

35 

Oligodeoxy- SEQUENCE 
nucleotide 

A 5'-p-TTGTGCCACGCGGTTGGGAATGTA (7521 Da) 

40 


SEQ ID 
No. 


B 5'-p-AGCAACGACTGTTTGCCCGCCAGTTG {7948 Da) 10 

C 5'-bio-TACATTCCCAACCGCGTGGCACAAC (7960 Da) 1 1 

D 5'-p-AACTGGCGGGCAAACAGTCGTTGCT (7708 Da) 12 

50 5-PhosphoryIation of oligonucleotides A and D 

[0276] This was performed with polynucleotide kinase (Boehringer, Mannheim, Germany) according to published 
procedures, the 5'-phosphorylat6d oligonucleotides were used unpurlfied for LCR. 

55 Ligase chain reaction 

[0277] The LCR was performed with Pfu DNA ligase and a ligase chain reaction kit (Stratagene, Heidelberg, Ger- 
many) containing two different pBluescript KM phagemids. One carrying the wildtype form of the E.coii lac\ gene and 
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the other one a mutant of this gene with a single point nnutation at bp 191 of the lad gene. 

[0278] The following LCR conditions were used for each reaction: 100 pg template DNA (0.74 fmol) with 500 pg 
sonified salmon sperm DNA as carrier, 25 ng (3.3 pmol) of each 5'-phosphorylated oligonucleotide, 20 ng (2.5 pmol) 
of each non-phosphorylated oligonucleotide, 4 U Pfu DNA ligase in a final volume of 20 |al buffered ss 50-mer was 

5 used (I fmol) as template, in this case oligo C was also biotinylated. All reactions were performed in a thermocycler 
(OmniGene, MWG-Biotech, Ebersberg, Germany) with the following program: 4 minutes 92^0, 2 minutes 60*C and 
25 cycles of 20 seconds 92*C, 40 seconds 60*C. Except for HPLC analysis the biotinylated ligation educt C was used. 
In a control experiment the biotinylated and non-biotinylated oligonucleotides revealed the same gel electrophoretic 
results. The reactions were analyzed on 7.5% polyacrylamide gels. Ligation product 1 (oligo A and B) calculated mass: 

10 15450 Da, ligation product 2 (oligo C and D) calculated mass: 15387 Da. 

SMART-HPLC 

[0279] Ion exchange HPLC (IE HPLC) was performed on the SMART-system (Pharmacia, Freiburg, Germany) using 
15 a Pharmacia Mono Q, PC 1 .6/5 column. Eluents were buffer A (25 mM Tris-HCI, 1 mM EDTA and 0.3 M NaCI at pH 
8.0) and buffer B (same as A, but 1 M NaCI). Starting with 1 00% A for 5 minutes at a flow rate of 50|al/min. a gradient 
was applied from 0 to 70% B in 30 minutes, then increased to 1 00% B in 2 minutes and held at 1 00% B for 5 minutes. 
Two pooled LCR volumes (40^Ll) performed with either wildtype or mutant template were injected. 

20 Sample preparation for MALDl-TOF-MS 

[0280] Preparation of immobilized DNA: For the recording of each spectrum two LCRs (performed as described 
above) were pooled and diluted 1:1 with 2x B/W buffer (10 mM Tris-HCI, pH 7.5, 1 mM EDTA, 2 M NaCI). To the 
samples 5 \i\ streptavidin DynaBeads (Dynal, Hamburg, Germany) were added, the mixture was allowed to bind with 
25 gentle shaking for 15 minutes at ambient temperature. The supernatant was removed using a Magnetic Particle Col- 
lector, MPC, (Dynal, Hamburg, Germany) and the beads were washed twice with 50 ^lI of 0.7 M ammonium citrate 
solution (pH 8.0) (the supernatant was removed each time using the MPC). The beads were resuspended in 1 [il of 
ultrapure water (MiiilQ, Millipore, Bedford, Mabelow). 

[0281] Combination of ultrafiltration and streptavidin DynaBeads: Forthe recording of spectrumtwo LCRs (performed 
30 as described above) were pooled, diluted 1 :1 with 2x B/W buffer and concentrated with a 5000 NMWL Ultrafree-MC 
filter unit (Millipore, Eschborn, Germany) according to the instructions of the manufacturer. After concentration the 
samples were washed with 300 |al 1 x B/W buffer to streptavidin DynaBeads were added. The beads were washed once 
on the Ultrafree-MC filtration unit with 300 )xl of 1x B/W buffer and processed as described above. The beads were 
resuspended in 30 to 50 ^il of 1 x B/W buffer and transferred in a 1 .5 ml Eppendorf tube. The supernatant was removed 
35 and the beads were washed twice with 50 [jlI of 0.7 M ammonium citrate (pH 8.0). Finally, the beads were washed once 
with 30 \i\ of acetone and resuspended in 1 \i\ of ultrapure water. The ligation mixture after immobilization on the beads 
was used for MALDS-TOF-MS analysis as described below. 

MALDl-TOF-MS 

40 

[0282] A suspension of streptavidin-coated magnetic beads with the immobilized DNA was pipetted onto the sample 
holder, then immediately mixed with 0.5 \i\ matrix solution (0.7 M 3-hydroxypicolinic acid in 50% acetonitrile, 70 mM 
ammonium citrate). This mixture was dried at ambient temperature and introduced into the mass spectrometer. All 
spectra were taken in positive ion mode using a Finnigan MAT Vision 2000 (Finnigan MAT, Bremen, Gemnany), 
45 equipped with a reflectron (5 keV ion source, 20 keV postacceleration) and a nitrogen laser (337 nm). Forthe analysis 
of Pfu DNA ligase 0.5 \i\ of the solution was mixed on the sample holder with 1 [il of matrix solution and prepared as 
described above. Forthe analysis of unpurified LCRs 1 ^,1 of an LCR was mixed with 1 ^1 matrix solution. 

RESULTS 

50 

[0283] The E. cott {ac\ gene sen/ed as a simple model system to investigate the suitability of MALDl-TOF-MS as 
detection method for products generated in ligase chain reactions. This template system contains of an E. calf lad 
wildtype gene in a pBluescript Kll phagemid and an E. coli lac\ gene carrying a single point mutation at bp 1 91 (C to 
T transition; SEQ ID No. 131) in the same phagemid. Four different oligonucleotides were used, which were ligated 
55 only if the E co///ac/ wildtype gene was present (Figure 26). 

[0284] LCR conditions were optimized using Pfu DNA ligase to obtain at least 1 pmol ligation product in each positive 
reaction. The ligation reactions were analyzed by polyacrylamide gel electrophoresis (PAGE) and HPLC on the SMART 
system (Figures 27, 28 and 29). Figure 27 shows a PAGE of a positive LCR with wildtype template (lane 1), a negative 
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LCR with mutant template (1 and 2) and a negative control which contains enzyme, oligonucleotides and no template 
but salmon sperm DNA. The gel electrophoresis clearly shows that the ligation product (50 bp) was produced only in 
the reaction with wildtype template; whereas neither the template carrying the point mutation nor the control reaction 
with salmon sperm DNA generated amplification products. In Figure 28, HPLC was used to analyze two pooled LCRs 
5 with wildtype template performed under the same conditions. The ligation product was clearly revealed. Figure 29 
shows the results of a HPLC in which two pooled negative LCRs with mutant template were analyzed. These chroma- 
tograms confirm the data shown in Figure 27 and the results taken together clearly demonstrate, that the system 
generates ligation products in a significant amount only if the wildtype template is provided. 

[0285] Appropriate control runs were performed to determine retention times of the different compounds involved in 
10 the L CR experiments. These include the four oligonucleotides (A, B, C, and D), a synthetic ds 50-mer (with the same 
sequence as the ligation product), the wildtype template DNA, sonicated salmon sperm DNA and the Pfu DNA ligase 
in ligation buffer. 

[0286] In orderto test which purification procedure should be used before a LCR reaction can be analyzed by MALDI- 
TOF-MS, allquots of an unpurified LCR (Figure 30A) and aliquots of the enzyme stock solution (Figure 30B) were 

15 analyzed with MALDI-TOF-MS. It turned out that appropriate sample preparation is absolutely necessary since all 
signals in the unpurified LCR correspond to signals obtained in the MALDI-TOF-MS analysis of the Pfu DNA ligase. 
The calculated mass values of oligo A and the ligation product are 7521 Da and 15450 Da, respectively. The data in 
Figure 30 show that the enzyme solution leads to mass signals which do interfere with the expected signals of the 
ligation educts and products and therefore makes an unambiguous signal assignment impossible. Furthermore, the 

20 spectra showed signals of the detergent Tw6en20 being part of the enzyme storage buffer which influences the crys- 
tallization behavior of the analyte/matrix mixture in an unfavorable way. 

[0287] In one purification format streptavidin-coated magnetic beads were used. As was shown in a recent paper, 
the direct desorption of DNA immobilized by Watson-Crick base pairing to a complementary DNA fragment covalently 
bound to the beads is possible and the non-biotinylated strand will be desorbed exclusively (Tang et aL (1995) Nucleic 

25 Acids Res. 23 :3126-3131). This approach in using immobilized ds DNA ensures that only the non-biotinylated strand 
will be desorbed. If non-immobilized ds DNA is analyzed both strands are desorbed (Tang et al. (1 994) Rapid Comm. 
Mass Soectrom. 7 183-186) leading to broad signals depending on the mass difference of the two single strands. 
Therefore, employing this system for LCR only the non-ligated oligonucleotide A, with a calculated mass of 7521 Da, 
and the ligation product from oligo A and oligo B (calculated mass: 15450 Da) will be desorbed if oligo C is biotinylated 

30 at the 5'-end and immobilized on steptavidin -coated beads. This results in a simple and unambiguous identification of 
the LCR educts and products. 

[0288] Figure 31 A shows a MALDI-TOF mass spectrum obtained from two pooled LCRs (performed as described 
above) purified on streptavidin DynaBeads and desorbed directly from the beads showed that the purification method 
used was efficient (compared with Figure 30). A signal which represents the unligated oligo A and a signal which 

35 corresponds to the ligation product could be detected. The agreement between the calculated and the experimentally 
found mass values is remarlcable and allows an unambiguous peak assignment and accurate detection of the ligation 
product. In contrast, no ligation product but only oligo A could be detected in the spectrum obtained from two pooled 
LCRs with mutated template (Figure 3IB). The specificity and selectivity of the LCR conditions and the sensitivity of 
the MALDI-TOF detection is further demonstrated when performing the ligation reaction in the absence of a specific 

40 template. Figure 32 shows a spectrum obtained from two pooled LCRs in which only salmon sperm DNA was used as 
a negative control, only oligo A could be detected, as expected. 

[0289] While the results shown in Figure 31 A can be correlated to lane 1 of the gel in Figure 27, the spectrum shown 
in Figure 31 B is equivalent to lane 2 in Figure 27, and finally also the spectrum in Figure 32 corresponds to lane 3 in 
Figure 27. The results are in congruence with the HPLC analysis presented in Figures 28 and 29. While gel electro- 
ns phoresis (Figure 27) and HPLC (Figures 28 and 29) reveal either an excess or almost equal amounts of ligation product 
over ligation educts, the analysis by MALDI-TOF mass spectrometry produces a smaller signal for the ligation product 
(Figure 31 A). 

[0290] The lower intensity of the ligation product signal could be due to different dssorption/ionization efficiencies 
between 24- and a SO-mer. Since the T^, value of a duplex with 50 compared to 24 base pairs is significantly higher, 
50 more 24-mer could be desorbed. A reduction in signal intensity can also result from a higher degree of fragmentation 
in case of the longer oligonucleotides. 

[0291] Regardless of the purification with streptavidin DynaBeads, Figure 32 reveals traces of Tween20 in the region 
around 2000 Da. Substances with a viscous consistence, negatively influence the process of crystallization and there- 
fore can be detrimental to mass spectrometer analysis. Tween20 and also glycerol which are part of enzyme storage 
55 buffers therefore should be removed entirely prior to mass spectrometer analysis. For this reason an improved purifi- 
cation procedure which includes an additional ultrafiltration step prior to treatment with DynaBeads was investigated. 
Indeed, this sample purification resulted in a significant improvement of MALDI-TOF mass spectrometric performance. 
[0292] Figure 33 shows spectra obtained from two pooled positive (Fig. 33A) and negative (Fig. 33B) LCRs, respec- 
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tively. The positive reaction was performed with a chemically synthesized, single strand 50mer as template with a 
sequence equivalent to the ligation product of oligo C and D. Oligo C was 5'-biotinylated. Therefore the template was 
not detected. As expected, only the ligation product of Oligo A and B (calculated mass 15450 Da) could be desorbed 
from the immobilized and ligated oligo C and D. This newly generated DNAfragment is represented by the mass signal 
5 of 1 5448 Da in Figure 33A. Compared to Figure 32A, this spectrum clearly shows that this method of sample preparation 
produces signals with improved resolution and intensity. 

EXAMPLE 7 

10 Mutation detection by solid phase oiigo base extension of a primer and analysis by MALDI-TOF mass 
spectrometry (Primer Oligo Base Extension = Probe) 

Summary 

15 [0293] The solid-phase oligo base extension method detects point mutations and small deletions as well as small 
insertions in amplified DNA. The method is based on the extension of a detection primer that anneals adjacent to a 
variable nucleotide position on an affinity-captured amplified template, using a DNA polymerase, a mixture of three 
dNTPs, and the missing one dideoxy nucleotide. The resulting products are evaluated and resolved by MALDI-TOF 
mass spectrometry without further labeling procedures. The aim of the following experiment was to detemnine mutant 

20 and wildtype alleles in a fast and reliable manner. 

Description of the experiment 

[0294] The method used a single detection primer followed by a oligonucleotide extension step to give products 
25 differing in length by some bases specific for mutant or wildtype alleles which can be easily resolved by MALDI-TOF 
mass spectrometry. The method is described by using as example the exon 10 of the CFTR-gene. Exon 10 of this 
gene bears the most common mutation in many ethnic groups (AF508) that leads in the homozygous stateto the clinical 
phenotype of cystic fibrosis. 

30 MATERIALS AND METHODS 

Genomic DNA 

[0295] Genomic DNA were obtained from healthy individuals, individuals homozygous or heterozygous for the AF508 
35 mutation, and one individual heterozygous for the 1506S mutation. The wildtype and mutant alleles were confirmed 
by standard Sanger sequencing. 

PCR amplification of exon 10 of tiie CFTR gene 

40 [0296] The primers for PCR amplification were CFEx10-F (5-GCAAGTGAATCCTGAGCGTG-3' (SEQ ID No. 13) 
located in intron 9 and blotinylated) and CFExl 0-R (5'-GTGTGAAGGGCGTG-3' SEQ ID No. 14) located in intron 1 0). 
Primers were used in a concentration of 8 pmol. Taq -polymerase including 1 0x buffer were purchased from Boehringer- 
Mannheim and dTNPs were obtained from Pharmacia. The total reaction volume was 50 ^lI. Cycling conditions for PCR 
were initially 5 min. at 95'*C, followed by 1 min. at 94*C, 45 sec at 53*0, and 30 sec at 72*C for 40 cycles with a final 

45 extension time of 5 min at 72*C. 

Purification of ttie amplified products 

[0297] Amplification products were purified by using Qiagen's PCR purification kit (No. 281 06) according to manu- 
50 facturer's instructions. The elution of the purified products from the column was done in 50 jxl TE-buffer (1 OmM Tris, 1 
mM EDTA, pH 7,5). 

Affinity-capture and denaturation of the double stranded DNA 

55 [0298] 1 0 |xL aliquots of the purified amplified product were transferred to one well of a streptavidin-coated microtiter 
plate (No. 1645684 Boehringer-Mannheim or No. 95029262 Labsystems). Subsequently, 10 ^il incubation buffer (80 
mM sodium phosphate, 400 mM NaCI, 0,4% Tween20, pH 7,5) and 30 |a I water were added. After incubation for 1 hour 
at room temperature the wells were washed three times with 200^lI washing buffer (40 mM Tris, 1 mM EDTA, 50 mM 
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NaCI, 0.1% Tween 20, pH 8.8). To denature the double stranded DNA the wells were treated with 1 00 \xi of a 50 mM 
NaOH solution for 3 min and the wells washed three times with 200 washing buffer. 

Ofigo base extension reaction 

5 

[0299] The annealing of 25 pnnol detection primer (CF508: 5'-CTATATTCATCATAGGAAACACCA-3' (SEQ ID No. 
15) was perfornned in 50 \i\ annealing buffer (20mMTris, 10 mM KCI, lOmM (NH4)2S04, 2 mM MgS02, 1% Triton X- 
100, pH 8) at SO^C for 1 0 min. The wells were washed three times with 200 |il washing buffer and once in 200 ^lI TE 
buffer. The extension reaction was perfornned by using some components of the DNA sequencing kit from USB (No. 
10 70770) and dNTPs or ddNTPs from Pharmacia. The total reaction volume was 45 \x\, containing of 21 \i\ water, 6 \i\ 
Sequenase-buffer, 3 |al 1 0 mM DTT solution, 4,5 [xl, 0,5 mM of three dNTPs, 4,5 2 mM the missing one ddNTP, 5,5 
|al glycerol enzyme dilution buffer, 0,25 ^lI Sequenase 2.0, and 0,25 pyrophosphatase. The reaction was pipetted on 
ice and then incubated for 15 min at room temperature and for 5 min at 37''C. Hence, the wells were washed three 
times with 200 \x.\ washing buffer and once with 60 ^1 of a 70 mM NH4-Citrate solution. 

15 

Denaturation and precipitation of the extended primer 

[0300] The extended primer was denatured in 50 \i\ 1 0%-DMSO (dimethylsufoxide) in water at 80*'C for 1 0 min. For 
precipitation, 10 ^il NH4-Acetate (pH 6.5), 0,5 |.il glycogen (10 mg/ml water, Sigma No. G1765), and 100 [i\ absolute 
20 sthanol were added to the supernatant and incubated for 1 hour at room temperature. After centrifugation at 13.000 g 
for 10 min the pellet was washed in 70% ethanol and resuspended in 1 jil 18 Mohm/cm water. 

Sample preparation and analysis on MALDl-TOF mass spectrometry 

25 [0301] Sample preparation was performed by mixing 0,3 \i\ of each of matrix solution (0.7 M 3-hydroxypicolinic acid, 
0.07 M dibasic ammonium citrate in 1 :1 H20:CH3CN) and of resuspended DNA/glycogen pellet on a sample target 
and allowed to air dry. Up to 20 samples were spotted on a probe target disk for introduction into the source region of 
an unmodified Thermo Bioanalysis (formerly Finnigan) Visions 2000 MALDI-TOF operated in reflectron mode with 5 
and 20 IcV on the target and conversion dynode, respectively. Theoretical average molecular mass ([^^(calc)) were 

30 calculated from atomic compositions; reported experimental Mr (Mr(exp)) values are those of the singly-protonated 
form, determined using external calibration. 

RESULTS 

35 [0302] The aim of the experiment was to develop a fast and reliable method independent of exact stringencies for 
mutation detection that leads to high quality and high throughput in the diagnosis of genetic diseases. Therefore a 
special kind of DNA sequencing (ollgo base extension of one mutation detection primer) was combined with the eval- 
uation of the resulting mini-sequencing products by matrix-assisted laser desorption ionization (IWALDI) mass spec- 
trometry (MS). The time-of -flight (TOF) reflectron arrangement was chosen as a possible mass measurement system. 

40 To prove this hypothesis, the examination was performed with exon 1 0 of the CI=TR-gene, in which some mutations 
could lead to the clinical phenotype of cystic fibrosis, the most common monogenetic disease in the Caucasian pop- 
ulation. 

[0303] The schematic presentation as given in Figure 34 shows the expected short sequencing products with the 
theoretically calculated molecular mass of the wildtype and various mutations of exon 1 0 of the GFTR-gene (SEQ ID 

45 No. 132). The short sequencing products were produced using either ddTTP (Figure 34A; SEQ ID Nos. 133-135) or 
ddCTP (Figure 34B; SEQ ID Nos. 136-1 39) to introduce a definitive sequence related stop in the nascent DNA strand. 
The ru1ALDI-T0F-MS spectra of healthy, mutation heterozygous, and mutation homozygous individuals are presented 
in Figure 35. All samples were confirmed by standard Sanger sequencing which showed no discrepancy in comparison 
to the mass spec analysis. The accuracy of the experimental measurements of the various molecular masses was 

50 within a range of minus 21 .8 and plus 87.1 dalton (Da) to the range expected. This allows a definitive interpretation of 
the results in each case. A further advantage of this procedure is the unambiguous detection of the AI507 mutation. 
In the ddTTP reaction, the wildtype allele would be detected, whereas in the ddCTP reaction the three base pair deletion 
would be disclosed. 

[0304] The method described is highly suitable for the detection of single point mutations or microlesions of DNA. 
55 Careful choice of the mutation detection primers will open the window of multiplexing and lead to a high throughput 
including high quality in genetic diagnosis without any need for exact stringencies necessary in comparable allele- 
specific procedures. Because of the uniqueness of the genetic information, the oligo base extension of mutation de- 
tection primer is applicable in each disease gene or polymorphic region in the genome like variable number of tandem 
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repeats (VNTR) or other single nucleotide polymorphisms (e.g. , apolipoprotein E gene), as also described here. 
EXAMPLE 8 

5 Detection of Polymerase Chain Reaction Products Containing 7-Deazapurine IVIoieties witli Matrix- Assisted 
Laser Desorption/lonization Time-of- Flight (MALDI-TOF) Mass Spectrometry 

MATERIALS AND METHODS 

10 Nucleic acid amplifications 

[0305] The following oligodeoxynucleotide primers were either synthesized according to standard phosphoamidite 
chemistry (Sinha, N.D,.etal., (1983) Tetrahedron Let. \/o\. 24, Pp. 5843-5846; Sinha, N.D., etaL, (1984) NucfeicAcids 
Res., Vol. 12, Pp. 4539-4557) on a MilllGen 7500 DNA synthesizer (Miilipore, Bedford, MA, USA) in 200 nmoi scales 
15 or purchased from MWG-Blotech (Ebersberg, Germany, primer 3) and Biometra (Goettingen, Germany, primers 6-7). 


primer 1 
primer 2 
primer 3 
primer 4 
primer 5 
primer 6 


5'-GTCACCCTCGACCTGCAG (SEQ ID NO. 16); 
5'-TTGTAAAACGACGGCCAGT (SEQ ID NO. 17); 
5'-CTTCCACCGCGATGTTGA (SEQ ID NO. 18); 
5'-CAGGAAACAGCTATGAC (SEQ ID NO. 19); 
5'-GTAAAACGACGGCCAGT {SEQ ID NO. 20); 
5'-GTCACCCTCGACCTGCAgC (g: RiboG) {SEQ ID NO. 21); 


30 

primer 7: 5'-GTTGTAAAACGAGGGCCAgT (g: RiboG) (SEQ ID NO. 
22); 


35 [0306] The 99-mer (SEQ ID No. 141) and 200-mer DNA strands (SEQ ID No. 140; modified and unmodified) as well 
as the ribo- and 7-deaza-modifled 1 00-mer were amplified from pRFcl DNA (10 ng, generously supplied by S. Feyer- 
abend, University of Hamburg) In 100 \xL reaction volume containing 1 0 mmol/L KCI, 10 mmol/L (NH4)2S04, 20 mmol/ 
LTris HCI (pH 8.8), 2 mmol/L MgS04, (exo(-) Pseudococcus furiosus (Pfu) -Buffer, Pharmacia, Freiburg, Gemnany), 
0.2 mmol/L each dNTP (Pharmacia, Freiburg, Gemiany), 1 p.mol/L of each primer and 1 unit of exo{-)Pfu DNA polymer- 

40 ase (Stratagene, Heidelberg, Germany). For the 99-mer primers 1 and 2, for the 200-mer primers 1 and 3 and for the 
1 00-mer primers 6 and 7 were used. To obtain 7-deazapurine modified nucleic acids, during PCR-amplification dATP 
and dGTP were replaced with 7-deaza-dATP and 7-deaza- dGTP. The reaction was perfomed in a thermal cycler 
(OmniGene, MWG-Blotech, Ebersberg, Germany) using the cycle; denaturation at 95"'C for 1 min., annealing at 51''C 
for 1 min. and extension at 72*'G for 1 min. For all PGRs the number of reaction cycles was 30. The reaction was 

45 allowed to extend for additional 1 0 min. at 72*0 after the last cycle. 

[0307] The 103-mer DNA strands (modified and unmodified; SEQ ID No. 245) were amplified from IVI13mp18 RFI 
DNA (100 ng, Pharmacia, Freiburg, Germany) in 100|iL reaction volume, using primers 4 and 5 all other concentrations 
were unchanged. The reaction was performed using the cycle: denaturation at 95'C for 1 min., annealing at 40*C for 
1 min. and extension at 72''C for 1 min. After 30 cycles for the unmodified and 40 cycles for the modified 103-mer 

50 respectively, the samples were incubated for additional 1 0 min. at 72°G. 

Synthesis of 5'-p^-P]-labeled PCR-primers 

[0308] Primers 1 and 4 were 5'-[32-P]-|abeled employing T4-polynucleotidkinase (Epicentre Technologies) and 
55 {y-^^P)-ACrP. (BLU/NGG/502A, Dupont, Germany) according to the protocols of the manufacturer. The reactions were 
performed substituting 1 0% of primer 1 and 4 in PGR with the labeled primers under otherwise unchanged reaction- 
conditions. The amplified DMAs were separated by gel electrophoresis on a 1 0% polyacrylamide gel. The appropriate 
bands were excised and counted on a Packard TRI-CARB 460C liquid scintillation system (Packard, CT, USA). 
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Primer-cleavage from ribo-modified PCR-product 

[0309] The amplified DNAwas purified using Uitrafree-MC filter units (30,000 NIWWL), it wastlnen redissolved in 100 
|ai of 0.2 mol/L NaOH and heated at 95''Cfor25 minutes. The solution was then acidified with HC1 (1 mol/L) and further 
5 purified for MALDI-TOF analysis employing Ultrafree-MC filter units (1 0,000 NMWL) as described below. 

Purification of amplified products 

[0310] All samples were purified and concentrated using Ultrafree-MC units 30000 NMWL (Millipore, Eschborn, Ger- 
10 many) according to the manufacturer's description. After lyophilization, amplified products were redissolved in 5 i^L (3 
|aL for the 200-mer) of ultrapu re water. This analyte solution was directly used for MALDI-TOF measurements. 

MALDI-TOF MS 

15 [0311] Allquots of 0.5 |iL of analyte solution and 0.5 |iL of matrix solution (0.7 mol/L 3-HPA and 0.07 mol/L ammonium 
citrate in acetonitrile/water (1 :1 , v/v)) were mixed on aflat metallic sample support. After drying at ambient temperature 
the sample was introduced into the mass spectrometer for analysis. The MALDI-TOF mass spectrometer used was a 
Finnigan MAT Vision 2000 (Finnigan MAT, Bremen, Germany). Spectra were recorded in the positive ion reflector mode 
with a 5 l<eV ion source and 20 keV postacceleration. The instrument was equipped with a nitrogen laser (337 nm 

20 wavelength). The vacuum of the system was 3-4*1 0"^ hPa In the analyzer region and 1-4*10"'^hPa In the source region. 
Spectra of modified and unmodified DN Asamples were obtained with the same relative laser power; external calibration 
was performed with a mixture of synthetic oligodeoxynucleotides (7-to 50-mer). 

RESULTS AND DISCUSSION 

25 

Enzymatic synthesis of 7-deazapurine nucleotide containing nucleic acids by PCR 

[0312] In order to demonstrate the feasibility of MALDI-TOF MS for the rapid, gel-free analysis of short amplified 
products and to investigate the effect of 7-deazapurine modification of nucleic acids under MALDI-TOF conditions, two 

30 different primer-template systems were used to synthesize DNA fragments. Sequences are displayed in Figures 36 
and 37. While the two single strands of the 1 03-mer amplified product had nearly equal masses (Am = 8 u), the two 
single strands of the 99-mer differed by 526 u. Considering that 7-deazapurine nucleotide building blocks for chemical 
DNA synthesis are approximately 160 times more expensive than regular ones (Product Information, Glen Research 
Corporation, Sterling, VA) and their application in standard p-cyano-phosphoamidite chemistry is not trivial (Product 

35 Information, Glen Research Corporation, Sterling, VA; Schneider etaL (1 995) Nucl. Acids Res. 23:1570) the cost of 
7-d6aza purine modified primers would be very high. Therefore, to increase the applicability and scope of the method, 
all PCRs were performed using unmodified oligonucleotide primers which are routinely available. Substituting dATP 
and dGTP by c^'-dATP and c''-dGTP in polymerase chain reaction led to products containing approximately 80% 7-dea- 
za-purlne modified nucleosides for the 99-mer and 1 03-mer; and about 90% for the 200-mer, respectively. Table II 

40 shows the base composition of all PCR products. 


TABLE II: 


Base composition of the 99-mer, 1 03-mer and 200-mer PCR amplification products (unmodified and 7-deaza 

purine modified) 

DNA-fragments"" 

C 

T 

A 

G 

c''-deaza-A 

c^-deaza-6 

rel. mod. 2 

200-mers 

54 

34 

56 

56 




modified 200-mer 
s 

54 

34 

6 

5 

50 

51 

90% 

200-mer a 

56 

56 

34 

54 




modified 200-mer 
a 

56 

56 

3 

4 

31 

50 

92% 

1 03-mer s 

28 

23 

24 

28 





"s" and "a" describe "sense" and "antisense" strands of the double-stranded amplified product. 

Indicates relative modification as percentage of 7-deaza purine modified nucleotides of total amount of purine nucleotides. 
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TABLE II: (continued) 


10 


Base composition of tiie 99-mer, 103-mer and 200-mer PCR amplification products (unmodified and 7-deaza 

purine modified) 

DNA-fragments"' 

C 

T 

A 

G 

c^-deaza-A 

c''-deaza-6 

rei. mod. 2 

modified 1 03-mer 
s 

28 

23 

6 

5 

18 

23 

79% 

103-mer a 

28 

24 

23 

28 




modified 103-mer 
a 

28 

24 

7 

4 

16 

24 

78% 

99-mer s 

34 

21 

24 

20 




modified 99-mer s 

34 

21 

6 

5 

18 

15 

75% 

99-mer a 

20 

24 

21 

34 




modified 99-mer a 

20 

24 

3 

4 

18 

30 

87% 


"s" and "a" describe "sense" and "antisense" strands of the double-stranded amplified product. 


^ indicates reiatlve modification as percentage of 7-deaza purine modified nucleotides of total amount of purine nucleotides. 

[0313] It remained to be determined whetfner 80-90% 7-deaza-p urine modification is sufficient for accurate mass 
spectrometer detection. It was tlnerefore important to determine winetlier all purine nucleotides could be substituted 
during the enzymatic amplification step. This was not trivial since it had been shown that c^-dATP cannot fully replace 
dATP in PCR if Taq DNA polymerase is employed (Seela, F. and A. Roelling (1992) Nucleic Acids Res., 20,55-61). 
Fortunately it was found that exo{-)PfuDMA polymerase indeed could accept c^-dATP and c"^-dGTP in the absence of 
unmodified purine nucleoside triphosphates. The incorporation was less efficient leading to a lower yield of amplified 
product (Figure 38). 

[0314] To verify these results, the amplications with [^^pj-iabeled primers were repeated. The autoradiogram (Figure 
39) clearly shows lower yields for the modified PCR-products. The bands were excised from the gel and counted. For 
all amplified products the yield of the modified nucleic acids was about 50%, referring to the corresponding unmodified 
amplification product. Further experiments showed that exo(-)Deep Vent and Vent DNA polymerase were able to in- 
corporate c^-dATP and c^-dGTP during PCR as well. The overall performance, however, turned out to be best for the 
exo{-)Pfu DNA polymerase giving least side products during amplification. Using all three polymerases, it was found 
that such PCRs employing c''-dATP and c'^'-dGTP instead of their isosteres showed less side-reactions giving a cleaner 
PCR-product. Decreased occurrence of amplification side products may be explained by a reduction of primer mis- 
matches due to a ling template which is synthesized during PCR. Decreased melting pointfor DNA duplexes containing 
7-deaza-purine have been described (IWizusawa, S. et aL, (I 986) Nucleic Adds Res., 14, 1319-1324). In addition to 
the three polymerases specified above (exo(-) Deep Vent DNA polymerase, 5Vent DNA polymerase and exo(-) (Pfu) 
DNA polymerase), it Is anticipated that other polymerases, such as the Large Klenow fragment of E.coli DNA polymer- 
ase, Sequenase, Taq DNA polymerase and U AmpliTaq DNA polymerase can be used. In addition, where RNA is the 
template, RNA polymerases, such as the SP6 or the T7 RNA polymerase, must be used. 


MALDl-TOF mass spectrometry of modified and unmodified ampiified products. 

[0315] The 99-mer, 103-mer and 200-mer amplified products were analyzed by MALDl-TOF MS. Based on past 
experience, it was f<nown that the degree of depurination depends on the laser energy used for desorption and ionization 
of the analyte. Since the Influence of 7-deazapurine modification on fragmentation due to depurination was to be in- 
vestigated, all spectra were measured at the same relative laser energy. 

[031 6] Figures 40a and 40b show the mass spectra of the modified and unmodified 1 03-mer nucleic acids. In case 
of the modified 1 03-mer, fragmentation causes a broad {M + H)+ signal. The maximum of the peak is shifted to lower 
masses so that the assigned mass represents a mean value of {M + H)+ signal and signals of fragmented ions, rather 
than the (M + H)+ signal itself. Although the modified 1 03-mer still contains about 20% A and G from the oligonucleotide 
primers, it shows less fragmentation which is featured by much more narrow and symmetric signals. Especially peak 
tailing on the lower mass side due to depurination, is substantially reduced. Hence, the difference between measured 
and calculated mass Is strongly reduced although It is still below the expected mass. For the unmodified sample a 
(IVI+H)+ signal of 31670 was observed, which is a 97 u or 0.3% difference to the calculated mass. While, in case of the 
modified sample this mass difference diminished to 10 u or 0.03% (31713 u found, 31723 u calculated). These obser- 
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vations are verified by a significant increase in mass resolution of the (M+H)+ signal of the two signal strands (n/Am = 
67 as opposed to 1 8 for the unmodified sample with Am = full width at half maximum, fwhm). Because of the low mass 
difference between the two single strands (8 u) their individual signals were not resolved. 

[031 7] With the results of the 99 base pair DN A fragments the effects of increased mass resolution for 7-deazapurine 
5 containing DNA becomes even more evident. The two single strands in the unmodified sample were not resolved even 
though the mass difference between the two strands of the amplified product was very high with 526 u due to unequal 
distribution of purines and pyrimidines (figure 41 a). In contrast to this, the modified DNA showed distinct peaks for the 
two single strands (figure 41 b) which demonstrates the superiority of this approach for the detemnination of molecular 
weights to gel electrophoretic methods even more profound. Although base line resolution was not obtained the indi- 
go vidual masses were able to be assigned with an accuracy of 0.1%: Am = 27 u for the lighter (calc. mass = 30224 u) 
and Am = 14 u for the heavier strand (calc. mass = 30750 u). Again, it was found that the full width at half maximum 
was substantially decreased for the 7-deazapurine containing sample. 

[0318] In case the 99-mer and 1 03-mer, the 7-deazapurine containing nucleic acids seem to give higher sensitivity 
despite the fact that they still contain about 20% unmodified purine nucleotides. To get comparable signal-to-noise 
15 ratio at similar intensities for the (M + H)+ signals, the unmodified 99-mer required 20 laser shots in contrast to 1 2 for 
the modified one and the 1 03-mer required 1 2 shots for the unmodified sample as opposed to three for the 7-deaza- 
purine nucleoside-containing amplified product. 

[0319] Comparing the spectra of the modified and unmodified 200-mer amplicons, improved mass resolution was 
again found for the 7-deazapurine containing sample as well as increased signal intensities (Figures 42A and 42B). 
20 While the signal ofthe single strands predominates in the spectrum of the modified samplethe DNA-duplex and dimers 
of the single strands gave the strongest signal for the unmodified sample. 

[0320] A complete 7-deaza purine modification of nucleic acids may be achieved either using modified primers in 
PGR or cleaving the unmodified primers from the partially modified amplified product. Since disadvantages are asso- 
ciated with modified primers, as described above, a 100-mer was synthesized using primers with a ribo-modification. 

25 The primers were cleaved hydrolytically with NaOH according to a method developed earlier in our laboratory (Koester, 
H. et aL, Z Physiol. Chem., 359, 1570-1589). Figures 43A and 43B display the spectra of the amplified product before 
and after primer cleavage. Figure 43b shows that the hydrolysis was successful: The hydrolyzed amplified product as 
well as the two released primers could be detected together with a small signal from residual uncleaved 100-mer. This 
procedure is especially useful for the MALDI-TOF analysis of very short PCR-products since the share of unmodified 

30 purines originating from the primer increases with decreasing length of the amplified sequence. 

[0321] The remarkable properties of 7-deazapurine modified nucleic acids can be explained by either more effective 
desorption and/or ionization, increased ion stability and/or a lower denaturation energy ofthe double stranded purine 
modified nucleic acid. The exchange of the N-7 for a methyl group results in the loss of one acceptor for a hydrogen 
bond which influences the ability of the nucleic acid to form secondary structures due to non-Watson-Crick base pairing 

35 (Seela, F. and A. Kehne(1987) Biochemistry, 26, 2232-2238.). In addition to this the aromatic system of 7-deazapurine 
has a lower electron density that weakens Watson-Crick base pairing resulting In a decreased melting point (MIzusawa, 
S. et aL, (1 986) Nucleic Acids Res., 14, 1319-1324) ofthe double-strand. This effect may decrease the energy needed 
for denaturation of the duplex in the MALDI process. These aspects as well as the loss of a site which probably will 
carry a positive charge on the N-7 nitrogen renders the 7-deazapurine modified nucleic acid less polar and may promote 

40 the effectiveness of desorption. 

[0322] Because of the absence of N-7 as proton acceptor and the decreased polarization of the C-N bond in 7-dea- 
zapurine nucleosides depurination following the mechanisms established for hydrolysis in solution is prevented. Al- 
though a direct correlation of reactions in solution and in the gas phase is problematic, less fragmentation due to 
depurination of the modified nucleic acids can be expected in the MALDI process. Depurination may either be accom- 

45 panled by loss of charge which decreases the total yield of charged species or It may produce charged fragmentation 
products which decreases the intensity ofthe non fragmented molecular ion signal. 

[0323] The observation of increased sensitivity and decreased peak tailing of the (M-i-H)+ signals on the lower mass 
side due to decreased fragmentation of the 7-deazapurin6 containing samples indicate that the N-7 atom indeed is 
essential for the mechanism of depurination in the MALDI-TOF process. In conclusion, 7-d6azap urine containing nu- 
50 cleic acids show distinctly increased ion-stability and sensitivity under MALDI-TOF conditions and therefore provide 
for higher mass accuracy and mass resolution. 


55 
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EXAMPLE 9 

Solid Phase Sequencing and Mass Spectrometer Detection 

5 MATERIALS AND METHODS 

[0324] OligonucleotidGS were purchased from Operon Technologies (Alameda, CA) in an unpurified form. Sequenc- 
ing reactions were performed on a solid surface using reagents from the sequencing l<it for Sequenase Version 2.0 
(Amersham, Arlington Heights, Illinois). 

10 

Sequencing a 39-mer target 
Sequencing complex: 
15 [0325] 


SEQUENCE SEQ ID 

NO. 

20 

5'-TCTGGCCTGQTGCAGGGCCTATTGTAGTTGTGACGTACA-(A'')„-3' 23 
5'-TGTACGTCACAACT-3' (PNA 16/DNA) 24 

25 [0326] In order to perform solid-phase DMA sequencing, template strand DNA11683 was 3'-biotinylated by terminal 
deoxynucleotidyl transferase. A 30 \i\ reaction, containing 60 pmol of DNA11 683, 1 .3 nmol of biotin 14-dATP (GIBCO 
BRL, Grand Island, NY), 30 units of terminal transferase (Amersham, Arlington Heights, Illinois), and 1x reaction buffer 
(supplied with enzyme), was Incubated atST'^Cfor 1 hour. The reaction was stopped by heat Inactlvatlon of the terminal 
transferase atTO^Cfor 10 min. The resulting product was desalted by passing through aTE-10 spin column (Clontech). 

30 More than one molecules of biotin-14-dATP could be added to the 3'-end of DNA11683. The biotinylated DNA11683 
was incubated with 0.3 mg of Dynal streptavidin beads in 30 |il1x binding and washing buffer at ambient temperature 
for 30 min. The beads were washed twice with TE and redissolved in 30 |xl TE, 10 \\\ aliquot (containing 0.1 mg of 
beads) was used for sequencing reactions. 

[0327] The 0.1 mg beads from previous step were resuspended in a 10 ^,1 volume containing 2 ^lI of 5x Sequenase 
35 buffer (200 mM Tris-HGI, pH 7.5, 100 mM MgCIa, and 250 mM NaCI) from the Sequenase kit and 5 pmol of corre- 
sponding primer PNA 1 6/DNA. The annealing mixture was heated to 70*0 and allowed to cool slowly to room temper- 
ature over a 20-30 min time period. Then 1 |il0.1 M dith loth relto I solution, 1 |j,l Mn buffer (0.15 M sodium Isocit rate and 
0.1 M MgCl2),and2|xlof diluted Sequenase (3.25 units) were added. The reaction mixture was divided into four allquots 
of 3 p.1 each and mixed with termination mixes (each contains of 3 p.1 of the appropriate termination mix: 32 jiM c7dATP, 
40 32 |xM dCTP, 32 |j,M c7dGTP, 32 |j,M dTTP and 3.2 |xM of one of the four ddTNPs, In 50 mM NaCI). The reaction 
mixtures were incubated at 37''C for 2 min. After the completion of extension, the beads were precipitated and the 
supernatant was removed. The beads were washed twice and resuspended in TE and kept at 4'*G. 

Sequencing a 78-mer target 

45 

[0328] Sequencing complex: 

5'-AAGATCTGACCAGGGATTCGGTTAGCGTGACTGCTGCTGCTGCTGCT 
^° GCTGCTGGATGATCCGACGCATCAGATCTGG-(A'')„-3'{SEQ ID NO. 25) 

{TNR.PLASM2) 

5'-CTGATGCGTCGGATCATC-3' (CM1) (SEQ ID NO. 26) 

55 

[0329] The target TNR.PLASM2 was biotinylated and sequenced using procedures similar to those described in 
previous section (sequencing a 39-mer target). 
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Sequencing a 15-mer target with partialiy duplex probe 
[0330] Sequencing complex: 

5 

^'-F-GATGATCCGACGCATCACAGCTC^' (SEQ ID No. 27) 
"'-TCGGTTCCAAGAGCTGTGATGCGTCGGATCATC-b-^' {SEQ ID No. 28) 

10 [0331] CM1B3B was immobilized on Dynabeads M280 with streptavidin (Dynai, Norway) by incubating 60 pmoi of 
Civil B3B witin 0.3 magnetic beads in 30 )ai 1 IVI NaCI and TE (1x binding and wasining buffer) at room temperature for 
30 min. The beads were washed twice with TE and redissolved in 30 |al TE, 1 0 or 20 [il aliquot (containing 0.1 or 0.2 
mg of beads respectively) was used for sequencing reactions. 

[0332] The duplex was formed by annealing corresponding aliquot of beads from previous step with 10 pmol of 
15 DFIIaSF (or 20 pmol of DFIIaSF for 0.2 mg of beads) in a 9 |il volume containing 2 |il of 5x Sequenase buffer (200 
mM Tris-HCI, pH 7.5, 1 00 mM MgClg, and 250 mM NaCI) from the Sequenase kit. The annealing mixture was heated 
to 65'G and allowed to cool slowly to 37'C over a 20-30 min time period. The duplex primer was then mixed with 1 0 
pmol of TS10 (20 pmol of TS10 for 0.2 mg of beads) in 1 |j.l volume, and the resulting mixture was further incubated 
at 37*0 for 5 min, room temperature for 5-1 0 min. Then 1 \x\ 0.1 M dithiothreitol solution, 1 ^lI Mn buffer (0.15 M sodium 
20 isocitrate and 0.1 M MnCl2), and 2 \i\ of diluted Sequenase (3.25 units) were added. The reaction mixture was divided 
into four aliquots of 3 |i,l each and mixed with termination mixes (each contains of 4 \i\ of the appropriate termination 
mix: 16 piM dATP, 16 ^iM dCTP, 16 ^lM dGTP, 16 ^iM dTTP and 1.6 ^iM of one of the four ddNTPs, in 50 mM NaCI). 
The reaction mixtures were incubated at room temperature for 5 min, and 37''C for 5 min. After the completion of 
extension, the beads were precipitated and the supernatant was removed. The beads were resuspended in 20 |a,l TE 
25 and kept at 4'*C. An aliquot of 2 jxl (out of 20 from each tube was taken and mixed with 8 ^lI of formamide, the 
resulting samples were denatured at 90-95'*Cfor 5 min and 2 \l\ (out of 1 0 jii total) was applied to an ALF DNA sequencer 
(Pharmacia, Piscataway, NJ) using a 1 0% polyacrylamide gel containing 7 M urea and 0.6xTBE. The remaining aliquot 
was used for MALDI-TOF MS analysis. 

30 MALDl sample preparation and instrumentation 

[0333] Before MALDl analysis, the sequencing ladder loaded magnetic beads were washed twice using 50 mM 
ammonium citrate and resuspended in 0.5 \i\ pure water. The suspension was then loaded onto the sample target of 
the mass spectrometer and 0.5 ^lI of saturated matrix solution (3-hydroxypicolinic acid (HPA); ammonium citrate =10; 
35 1 mole ratio in 50% acetonitrile) was added. The mixture was allowed to dry prior to mass spectrometer analysis. 

[0334] The reflectron TOFMS mass spectrometer (Vision 2000, Finnigan MAT, Bremen, Germany) was used for 
analysis. 5 kV was applied in the ion source and 20 kV was applied for postacceleration. All spectra were taken in the 
positive ion mode and a nitrogen laser was used. Normally, each spectrum was averaged for more than 1 00 shots and 
a standard 25-point smoothing was applied. 

40 

RESULTS AND DISCUSSION 

Conventionaf solid-phase sequencing 

45 [0335] In conventional sequencing methods, a primer is directly annealed to the template and then extended and 
terminated in a Sanger dideoxy sequencing. Normally, a biotinylated primer is used and the sequencing ladders are 
captured by streptavidin-coated magnetic beads. After washing, the products are eluted from the beads using EDTA 
and formamide. Previous findings indicated that only the annealed strand of a duplex is desorbed and the immobilized 
strand remains on the beads. Therefore, it is advantageous to immobilize the template and anneal the primer. After 

50 the sequencing reaction and washing, the beads with the immobilized template and annealed sequencing ladder can 
be loaded directly onto the mass spectrometer target and mix with matrix. In MALDl, only the annealed sequencing 
ladder will be desorbed and ionized, and the immobilized template will remain on the target. 

[0336] A 39-mer template (SEQ ID No. 23) was first biotinylated at the 3'-end by adding biotin-1 4-dATP with terminal 
transferase. More than one biotin-1 4-dATP molecule could be added by the enzyme. Since the template was immobi- 
55 lized and remained on the beads during MALDl, the number of biotin-1 4-dATP would not affect the mass spectra. A 
14-mer primer (SEQ ID No. 24) was used forthe solid-state sequencing to generate DNA fragments 3-27 below (SEQ 
ID Nos. 142-166). MALDI-TOF mass spectra of the four sequencing ladders are shown in Figure 44 and the expected 
theoretical values are shown in Table III. 
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TABLE III 


1 
1 

D - 1 ^ 1 I Vau 1 ljLfr\OUIUv>Lf \f\ \ 1 S3 1 Mv3 1 1 w 1 UMV^VJ 1 rMw-M-lM J„-J 

•J 

O - 1 \^t\f\\j,Hi\^ \ VJ^M 10 1-3- 



4- 


c 
D 

Q' Al^AX/^A A^Af^Xf^f AXf^T K' 
O -At- A 1 trfAA^rf-At- 1 o L, A 1 IJ 1 - O 

6 

o» A A ^ AXi^ A A ^ Ar*xn^ Axrsx K' 

/ 

o / TA A/*AX/^A A^^ Al^Xrtf^ AX*"X K ' 

S 

O* AXA A/^AXr^A A/^AI^Xrt/^AXrtX K' 

o -A iAACATCAACAl- luCA 1 u i - O 

9 

O' ^AXAA/* AT/^ A A A^Xrt/^ AX^^X K' 

J -uA lAAC-ATCAACAt- 1 A 1 (j I -tJ 

1 0 

o -CjGA rAACATC-AAOAU 1 CaOA 1 Ca i -& 

1 1 

3 -CGGATAACATCAACACTGCAToT-b 

1 2 

^0^/^A"rAA*^A"r^AA/^A /^T*^^ A "P^T C ' 

3 -CCGGaTAACATCaaCaCTGCATQT-0 

1 3 

3 -CCCGGATAACATCAACACTQCATC3T-0 

1 4 

Q' xr*P'f^f^f2 AX A A A X/^ A A Af^Xr^ A X*^X K' 

0 - \ t-\-L-uuA I AAUA 1 UAAL-A^ 1 ot^A 1 0 1 -O 


1' rtX/^f*^^2/2 A X A A AT^^ A A A^XfS^ AXraX K' 

0 -oTCUUuoATAAUATt^AAtAt^ I laL-A I ta 1 -0 

1 O 

0 ' /^rtXr^/^frt/^ AXA A ^ AX/^ A A^ A/^xrs/^A xrtx C 

0 -L.(jTL.t-.C.(JuATAACATt-AAL.At. ( oUA 1 ta 1-0 

\ / 

AOf^xor^nrtr; AXA A Axr^ A Ar^ A/^xrt^ AXfTix K' 
0 -AL.G ( L.l-1-LiuA 1 AAUA 1 t*AAt^At, 1 ot-A 1 1 -D 

1 Q 

•S' OA/^flXr'fT'rtrt AXA A^^ AXr*A Ar* A/^Xf2/^AXrtX K' 

0 -UAL.V3 1 v^L-^-v30A 1 AAUA 1 LiAAUAl, 1 vjv^A I yj 1 -O 

1 ±7 

3 -v#CACtaTCCt.C3UATAACA T wAAUAC 1 ot*A 1 ta 1 -D 


1' Afr* Af^nXPr'Pf^fS AXA Af* AXf A Af^ Af^Xf2r*AXrtX t^' 

21 

I'-rt A rr* Ar'f^xr'n rrsfs A X A A p AXf A A A f^x^^ A xf5X_r% ' 

0 -OA^*»— 1 k^L-L>v3i?A 1 AAlrfA t VnAAV^-Mv^ 1 UV-M I U 1 -O 


T rtfSArT'Ar'rtXPr'r'rsrtAXA Ar'Axr*A Ar*Ar'xn/^AXrtx 

23 

3'-CGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 

24 

3'-CCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 

25 

3'-ACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 

26 

3'-GACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 

27 

3'-AGACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 
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TABLE III (Continued) 



A-reaction 

C-reaction 

G-reaction 

T-reaction 

1. 





2. 

4223.8 

4223.8 

4223.8 

4223.8 

3. 

4521.1 




4. 


4809.2 



5. 

5133.4 




6. 

5434.6 




7. 




5737.8 

8. 

6051.1 




9. 



6379.2 


10. 



6704.4 


11. 


6995.6 



12. 


7284.8 



13. 


7574.0 



14. 




7878.2 

15. 



8207.4 


16. 


8495.6 



17. 

8808.8 




18. 


9097.0 



19. 


9386.2 



20. 

9699.4 




21. 



10027.6 


22. 



10355.3 


23. 


10644.0 



24. 


10933.2 



25. 

11246.4 
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A-reaction 

C-reaction 

G-reaction 

T-reaction 

26. 



11574.6 


27. 

11886.8 





[0337] The sequencing reaction produced a relatively homogenous ladder, and the full-length sequence was deter- 
mined easily. One peak around 5150 appeared in all reactions are not identified. A possible explanation is that a small 
portion of the template formed some kind of secondary structure, such as a loop, which hindered sequenase extension. 
Mis-incorporation is of minor importance, since the intensity of these peaks were much lowerthan that of the sequencing 
ladders. Although 7-deaza purines were used in the sequencing reaction, which could stabilize the N-glycosidic bond 
and prevent depurination, minor base losses were still observed since the primer was not substituted by 7-deazapu- 
rines. The full length ladder, with a ddA at the 3' end, appeared in the A reaction with an apparent mass of 11 899.8. A 
more intense peak of 12333 appeared in all four reactions and is likely due to an addition of an extra nucleotide by the 
Sequenase enzyme. 

[0338] The same technique could be used to sequence longer DNA fragments. A 78-mer template containing a CTG 
repeat (SEQ ID No. 25) was 3'-biotinylated by adding biotin-14-dATP with terminal transferase. An 1 8-mer primer (SEQ 
ID No. 26) was annealed right outside the CTG repeat so that the repeat could be sequenced immediately after primer 
extension. The four reactions were washed and analyzed by MALDI-TOFMS as usual. An example of the G-reaction 
is shown in Figure 45 (SEQ ID Nos. 1 67-220) and the expected sequencing ladder is shown in Table IV with theoretical 
mass values for each ladder component. All sequencing peaks were well resolved except the last component (theo- 
retical value 20577.4) was indistinguishable from the background. Two neighboring sequencing peaks (a 62-mer and 
a 63-mer) were also separated indicating that such sequencing analysis could be applicable to longertemplates. Again, 
an addition of an extra nucleotide by the Sequenase enzyme was observed In this spectrum. This addition is not 
template specific and appeared in all four reactions which makes it easy to be identified. Compared to the primer peak, 
the sequencing peaks were at much lower intensity in the long template case. 
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TABLE IV Continued 



ddATP 

ddCTP 

ddGTP 

ddTTP 

1. 

5491.6 

5491.6 

5491.6 

5491.6 

2. 


5764.8 



3. 

6078.0 




4. 



6407.2 


5. 


6696.4 



6. 

7009,6 




7. 



7338.8 


8. 


7628.0 



9. 

7941.2 




10. 



8270,4 


11. 


8559.6 



12. 

8872.8 




13. 



9202.0 


14. 


9491.2 



15. 

9804.4 




16. 



10133.6 


17. 


10422.88 



18. 

10736.0 




19. 



11065.2 


20. 


11354.4 



21. 

11667.6 




22. 



11996.8 


23, 


12286.0 



24. 

12599.2 
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25. 



12928.4 


26. 




13232.6 

27. 


13521.8 



28. 

13835.0 




29. 


14124.2 



30. 



14453.4 


31. 


14742,6 



32. 




15046.8 

33. 

15360.0 




34. 

15673.2 




35. 


15962.4 



36. 


16251.6 



37. 



16580.8 


38. 

16894.0 




39. 

17207.2 




40. 






17511.4 

41. 


17800,6 



42. 


18189.8 



43. 


18379.0 



44. 




18683.2 

45. 



19012.4 


46. 



19341.6 


47. 




19645.8 

48, 


19935.0 



49. 

20248.2 




50. 



20577.4 
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51. 

20890.6 




52. 




21 194.4 

53. 


21484.0 



54. 




21788.2 

55. 




22092.4 


15 Sequencing using duplex DNA probes for capturing and priming 

[0339] Duplex DNA probes with single-stranded overhang have been demonstrated to be abie to capture specific 
DNA templates and aiso serve as primers for solid-state sequencing. The scheme is shown in Figure 46. Stacl<ing 
interactions between a duplex probe and a single-stranded template allow only a 5-base overhang to be sufficient for 

20 capturing. Based on this format, a 5' fluorescent-labeled 23-mer (5'-GAT GAT CCG ACG CAT CAC AGC TC-3') (SEQ 
ID No. 29) was annealed to a 3'-biotinylated 1 8-mer (5'-GTG ATG CGT CGG ATC ATC-3') (SEQ ID No. 30), leaving a 
5-base overhang. A 15-mer template (5'-TCG GTT CCA AGA GCT-3') (SEQ ID No. 31) was captured by the duplex 
and sequencing reactions were perfomned by extension of the 5-bas6 overhang. MALDI-TOF mass spectra of the 
reactions are shown in Figure 47A-D. All sequencing peaks were resolved although at relatively low intensities. The 

25 last peak in each reaction is due to unspecific addition of one nucleotide to the full length extension product by the 
Sequenase enzyme. For comparison, the same products were run on a conventional DNA sequencer and a stacking 
fluorogram of the results is shown in Figure 48. As can be seen from the Figure, the mass spectra had the same pattern 
as the fluorogram with sequencing peaks at much lower Intensity compared to the 23-mer primer. 

30 EXAMPLE 10 

Thermo Sequenase Cycle Sequencing 

MATERIALS AND METHODS 

35 

[0340] PCR amplification. Human leukocytic genomic DNA was used for PGR amplification. PGR primers to amplify 
a 209 bp fragment of the p-globin gene were the |32 fonward primer (5'-CAT TTG GTT GTG AGA GAA GTG-3' SEQ ID 
NO. 32) and the p1 1 reverse primer (5'-CTT GTG TGT GTG GAG ATG G-3' SEQ ID NO. 33). Taq polymerase and 1 0x 
buffer were purchased from Boehringer-Mannheim (Germany) and dNTPsfrom Pharmacia (Freiburg, Germany). The 

40 total reaction volume was 50 |j,l including 8 pmol of each primer with approximately 200 ng of genomic DNA used as 
template and a final dNTP concentration of 200 |iM. PGR conditions were: 5 min at 94''G, followed by 40 cycles of 30 
sec at 94''C, 45 sec at 53*C, 30 sec at 72*0, and a final extension time of 2 min at 72'*C. The generated amplified 
product was purified and concentrated (2x) with the Qiagen 'Qiaquick' PCR purification kit (#281 06) and stored in H2O. 
[0341] Cycle Sequencing. Sequencing ladders were generated by primer extension with Thermo Sequenase™ -DNA 

45 Polymerase (Amersham LIFE Science, #E79000Y) under the following conditions: 7 pmol of HPLC purified primer 
(GodS 1 2mer: 5'-TGG AGC TGA GTC-3' SEQ ID No. 34) were added to 6^lI purified and concentrated amplified product 
(i.e. 12[il of the original amplified product), 2.5 units Thermo Sequenase and 2.5 ml Thermo Sequenase reaction buffer 
in a total volume of 25|xl. The final nucleotide concentrations were 30|aM of the appropriate ddNTP (ddATP, ddCTP, 
ddGTP or ddTTP; Pharmacia Biotech, #27-2045-01) and 210|aM of each dNTP (7-d6aza-dATP, DCTP, 7-deaza-GTP 

50 dTTP; Pharmacia Biotech). 

[0342] Cycling conditions were: denaturation for 4 min at 94°C, followed by 35 cycles of 30 sec at 94°C, 30 sec at 
38*C, 30 sec at 55*0, and a final extension of 2 min at 72''C. 

[0343] Sample preparation and analysis by MALDI-TOF MS. After completion of the cycling program, the reaction 
volume was increased to 50|a.l by addition of 25|a.l HgO. Desalting was achieved by shaking 30 |il of ammonium saturated 
55 DOWEX (Fluka #44485) cation exchange beads with 50 \i\ of the analyte for 2 min at room temperature. The Dowex 
beads, purchased in the protonated form, were pre-treated with 2M NH4OH to convert them to the ammonium form, 
then washed with HjO until the supernatant was neutral, and finally put in 1 0 mM ammonium citrate for usage. After 
the cation exchange, DNA was purified and concentrated by ethanol precipitation by adding S^il 3 M ammonium acetate 
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(pH 6.5), 0.5 glycogen (10 mg/ml, Sigma), and 110 ^il absolute ethanol to the analyte and incubated at room tem- 
perature for 1 hour. After 12 min centrifugation at 20,000 X g the pellet was washed in 70% ethanol and resuspended 
in 1 )j.l18 Mohm/cm H2O water. 

[0344] For MALDI-TOF MS analysis 0.35 ^il of resuspended DNA was mixed with 0.35-1 .3 ^il matrix solution (0.7 M 
5 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1 :1 HgOiCHgCN) on a stainless steel sample target disk 
and allowed to air dry preceding spectrum acquisition using aThemno Bioanalysis Vision 2000 MALDI-TOF operated 
in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. External calibration generated 
from eight peaks (3000-18000 Da) was used for all spectra. 

10 RESULTS 

[0345] FIGURE 49 shows a MALDI-TOF mass spectrum of the sequencing ladder generated from a biological am- 
plified product as template and a 12mer (5'-TGC ACC TGA CTC-3'(SEQ ID NO. 34)) sequencing primer. The peaks 
resulting from depurinations and peaks which are not related to the sequence are marked by an asterisk. MALDI-TOF 

15 MS measurements were taken on a reflectron TOF MS. A.) Sequencing ladder stopped with ddATP; B.) Sequencing 
ladder stopped with ddCTP; C.) Sequencing ladder stopped with ddGTP; D.) Sequencing ladder stopped with ddTTP. 
[0346] FIGURE 50 shows a schematic representation of the sequencing ladder generated in Fig. 49 with the corre- 
sponding calculated molecular masses up to 40 bases after the primer (SEQ ID Nos 221-260). For the calculation the 
following masses were used: 3581 .4 Da for the primer, 312.2 Da for 7-deaza-dATP, 304.2 Da for dTTP, 289.2 Da for 

20 dCTP and 328.2 Da for 7-dsaza-dGTP. 

[0347] FIGURE 51 shows the sequence of the amplified 209bp amplified product within the p-globin gene (SEQ ID 
No. 261 ) , which was used as a template for sequencing. The seq uences of the appropriate PGR primer and the location 
of the 12msr sequencing primer is also shown. This sequence represents a homozygote mutant at the position 4 after 
the primer. In a wildtype sequence this T would be replaced by an A. 

25 

EXAMPLE 11 

Microsatellite Analysis Using Primer Oligo Base Extension (PROBE) and MALDi-TOF Mass Spectrometry 
30 SUMMARY 

[0348] The method uses a single detection primer followed by an oligonucleotide extension step to give products 
differing in length by a number of bases specific for the number of repeat units or for second site mutations within the 
repeated region, which can be easily resolved by MALDI-TOF mass spectrometry. The method is demonstrated using 
35 as a model system the AluVpA polymorphism in intron 5 of the interferon-a receptor gene located on human chromo- 
some 21 , and the poly T tract of the splice acceptor site of Intron 8 from the CFTR gene located on human chromosome 
7. 

MATERIALS AND METHODS 

40 

[0349] Genomic DNA was obtained from 18 unrelated individuals and one family including of a mother, father, and 
three children. The repeated region was evaluated conventionally by denaturing gel electrophoresis and results ob- 
tained were confirmed by standard Sanger sequencing. 

[0350] The primers for PGR amplification (Spmol each) were IFNAR-IVS5-5': (5'-TGG TTA CTT AAC CCA GTG TG- 
45 3' SEQ ID. N0.35) and IFNAR-IVS5-3'.2: (5' -GAG ACT ATG TAA TAG TAT GC-3' SEQ ID. NO. 36) for a part of the 
intron 5 of the interferon-a receptor gene, and CFEx9-F:(5'-GAA AAT ATG TGA CAA ACT CAT C-3' SEQ ID. N0.37) 
(5'-biotinylated) and CFEx9-R:(5'-GAT GGACAC CAA ATT AAG TTC-3'SEQ ID. N0.38) for CFTR exon 9 with flanking 
intron sequencesof the CFTR gene. Taq- polymerase including lOX buffer were purchased from Boehringer-Mannheim 
and dNTPs were obtained from Pharmacia. The total reaction volume was 50 \i^ . PGR conditions were 5 min at 94''C 
50 followed by 40 cycles of: 1 min at 94°C, 45 sec at53°C, and 30secat 72°C, and a final extension time of 5 min at72°C. 
[0351] Amplification products were purified using Qiagen's PGR purification kit (No. 281 06) according to manufac- 
turer's instructions. Purified products were elutedfrom the column in 50 i^l TE-buffer (lOmM Tris-HCI, 1 mM EDTA, pH 
7,5). 

55 A) Primer oligo base extension reaction (thermo cycling method 

[0352] CyclePROBE was performed with 5 pmol appropriate detection primer (I FN :5'-TGA G AC TCT GTG TG-3'SEQ 
ID. NO. 39) in a total volume of 25 \i\ including I pmol purified template, 2 units Thermosequenase (Amersham Life 
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Science, Cat. #E79000Y) 2.5 Thermosequenase buffer, 25 p,mol of each deoxy nucleotide (7-deaza-dATP, dTTP, 
and in some experiments extra dCTP) and 100 [imo\ of dideoxyguanine and in some experiments additional ddCTP. 
Cycling conditions: initiai denaturation 94*'C for 5 min foilowed by 30 cycles witin 44'*C anneaiing tennperature for 30 
sec and 55°C extension temperature for 1 min. 

5 

Primer oligo base extension reaction (isothermai metfiod) 

[0353] 1 0 |il aliquots of the purified double-stranded annplified product (~3 pmol) were transferred to a streptavidin- 
coated microliter plate well (—16 pmol capacity per 50 \l\ voiume; No. 1645684 Boehringer-Mannheim), foliowed by 
10 addition of lOjii incubation buffer (80 mM sodium phosphate, 400 mM NaCi, 0.4%Tween 20, pH 7.5) and 30 \x\ water. 
After incubation for 1 hour at room temperature, the weiis were washed three times with 200 [xl washing buffer A (40 
mlVI Tris, 1 mM EDTA, 50 mM NaCI, 0.1% Tween 20, pH 8.8) and incubated with 100 ^il of 50 mM NaOH for 3 min to 
denature the double-stranded DNA. Finally, the wells were washed three times with 200 \i\ 70 mM ammonium citrate 
solution. 

15 [0354] The annealing of 100 pmol detection primer (CFpT: 5'-TTG CCC AAA TCC GTG-3' SEQ ID NO. 40) was 
performed in 50 \i\ annealing buffer (50 mM ammonium phosphate buffer, pH 7.0 and 1 00 mM ammonium chloride) at 
65*C for 2 min, at 37*'C for 10 min, and at room temperature for 10 min. The wells were washed three times with 200 
1^1 washing buffer B (40 mM Tris, 1 mM EDTA, 50 mM NH4CI, 0.1% Tween 20, pH 8.8) and once in 200 (il TE buffer. 
The extension reaction was performed using some components of the DNA sequencing kit from USB (No. 70770) and 

20 dNTPs or ddNTPs from Pharmacia. Total reaction volume was 45 containing of 21 [xl water, 6 |i.l S equ en ase -buffer, 
3 |al 100 mM DTT solution, 50 )imol of 7-deaza-dATP, 20 |amol ddCTP, 5.5 ^il glycerol enzyme dilution buffer, 0.25 ^lI 
Sequenase 2.0, and 0.25 p.1 pyrophosphatase. The reaction was pipetted on ice and incubated for 15 min at room 
temperature and for 5 min at 37''C. Finally,the wells were washed three times with 200 \i\ washing buffer B. 
[0355] The extended primer was denatured from the template strand by heating at SO^C for 10 min in 50 \i\ of a 50 

25 mM ammonium hydroxide solution. 

[0356] For precipitation, 10 iil 3 M NH 4-acetate (pH 6.5), 0.5 [il glycogen (10 mg/ml water, Sigma, Cat.#G1765), 
and 11 0 nl absolute ethanol were added to the supernatant and incubated for 1 hour at room temperature. After cen- 
trifugation at 13.000 g for 1 0 min the pel let was washed in 70% ethanol and resuspended in 1 |j.l 18 Mohm/cm HgO water. 
[0357] Sample preparation was performed by mixing 0.6 ^lI of matrix solution (0.7 M 3-hydroxypicolinic acid, 0.07 M 

30 dibasic ammonium citrate in 1 :1 H20:CH3CN) with 0.3 pil of resuspended DNA/glycogen pellet on a sample target and 
allowed to air dry. Up to 20 samples were spotted on a probe target disk for introduction into the source region of a 
Thermo Bioanalysis (formerly Finnigan) Visions 2000 MALDI-TOF operated in refiectron mode with 5 and 20 kV on 
the target and conversion dynode, respectively. Theoretical average molecular mass (Mr(calc)) were calculated from 
atomic compositions; reported experimental (Mp(exp)) values are those of the singly-protonated form, determined 

35 using external calibration. 

RESULTS 

[0358] The aim of the experiments was to develop a fast and reliable method for the exact determination of the 
40 number of repeat units in microsateiiites or the length of a mononucleotide stretch including the potential to detect 
second site mutations within the polymorphic region. Therefore, a special kind of DNA sequencing (primer oligo base 
extension, PROBE) was combined with the evaluation of the resulting products by matrix-assisted laser desorption 
ionization (MALDI) mass spectrometry (MS). Thetime-of-flight (TOF) refiectron arrangementwaschosen-as a possible 
mass measurement system. As an initial feasibility study, an examination was performed first on an AluVpA repeat 
45 polymorphism located in intron 5 of the human interferon-a receptor gene (cyclePROBE reaction) and second on the 
poly T tract located in intron 8 of the human CFTR gene (isothermal PROBE reaction). 

[0359] A schematic presentation of the cyclePROBE experiment for the AluVpA repeat polymorphism is given in 
Figure 52. The extension of the antisense strand (SEQ ID No. 262) was performed with the sense strand sen/ing as 
the template. The detection primer is underlined. In a family study co-dominant segregation of the various alleles could 

50 be demonstrated by the electrophoretic procedure as well as by the cyclePROBE method followed by mass spec 
analysis (Figure 53). Those alleles of the mother and child 2, for which direct electrophoresis of the amplified product 
indicated one of the two copies to have 13 repeat units,were measured using cyclePROBE to have instead only 11 
units using ddG as temninator. The replacement of ddG by ddC resulted in a further unexpected short allele with a 
molecular mass of approximately 11650 in the DNA of the mother and child 2 (Figure 54). Sequence analysis verified 

55 this presence of two second site mutations in the allele with 13 repeat units. The first is a C to T transition in the third 
repeat unit and the second mutation is a T to G transversion in the ninth repeat unit. Examination of 28 unrelated 
individuals shows that the 13 unit allele is spliced into a normal allele and a truncated allele using cyclePROBE. Sta- 
tistical evaluation shows that the polymorphism is in Hardy-Weinberg equilibrium for both methods, however, using 
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cyclePROBE as detection method the polymorphism information content is increased to 0.734. 
[0360] PROBE was also used as an isothermic method for the detection of the three common alleles at the intron 8 
splice acceptor site of the CFTR gene (SEQ ID No. 263). Figure 55 shows a schennatic presentation of the expected 
diagnostic products (SEQ ID Nos. 264-266) with the theoretical mass values. The reaction was also performed in the 
5 antlsense direction. 

[0361] Figure 56 demonstrates that all three common alleles (T5, T7, and T9, respectively) at this locus could be 
reliably disclosed by this method. Reference to Figure 56 indicates that mass accuracy and precision with the reflectron 
time of flight used in this study ranged from 0-0.4%, with a relative standard deviation of 0.13%. This corresponds to 
far better than single base accuracy for the up to <90-mer diagnostic products generated in the IFNAR system. Such 

10 high analytical sensitivity is sufficient to detect single or multiple insertion/deletion mutations within the repeat unit or 
Its flanking regions, which would Induce > 1 % mass shifts in a 90-mer. This Is analogous to the Figure 56 polyT tract 
analysis. Other mutations (i.e. an A to T or a T to A mutation within the IFNAR gene AST repeat) which do not cause 
premature product termination are not detectable using any dNTP/ddNTP combination with PROBE and low perform- 
ance MS Instrumentation; a 9 Da shift In a 90-mer corresponds to a 0.03% mass shift. Achieving the accuracy and 

15 precision required to detect such minor mass shifts has been demonstrated with higher performance Instrumentation 
such as Fourier transform (FT)I\/IS, for which single Da accuracy is obtained up to 1 00-mers. Further, tandem FTMS, 
in which a mass shifted fragment can be isolated within the instrument and dissociated to generate sequence specific 
fragments, has been demonstrated to locate point mutations to the base in comparably sized products. Thus the com- 
bination of PROBE with higher performance instrumentation will have an analytical sensitivity which can be matched 

20 only by cumbersome full sequencing of the repeat region. 

EXAMPLE 12 

Improved Apolipoprotein E Genotyping Using Primer Oligo Base Extension (PROBE) and MALDI-TOF Mass 
25 Spectrometry 

MATERIALS AND METHODS 

PCR amplification. 

30 

[0362] Human leukocytic genomic DNA from 1 00 anonymous individuals from a previously published study (Braun, 
A ef a/., (1 992) Human Genet. 89 :401 -406) were screened for apolipoprotein E genotypes using conventional methods. 
PCR primers to amplify a portion of exon 4 of the apo E gene were delineated according to the published sequence 
(Das, HK et aL, (1 985) J. Biol. Ghem. 260 :6240-6247) (fonward primer, apoE-F; 5'-GGC ACG GOT GTG CAA GGA G- 

35 3'SEQ ID. N0.41 ; reverse, apoE-R: 5'-AGG COG GGG TGG GCG CGG TG-3'SEQ ID. N0.42). Taq polymerase and 
10x buffer were purchased from Boehringer-l\/lannhelm (Germany) and dNTPs from Pharmacia (Freiburg, Gemnany). 
The total reaction volume was 50 |aL Including 8 pmol of each primer and 1 0% DMSO (dimethylsulfoxide, Sigma) with 
approximately 200 ng of genomic DNA used as template. Solutions were heated to 80'C before the addition of 1U 
polymerase; PGR conditions were: 2 min at 94'G, followed by 40 cycles of 30 sec at 94'C, 45 sec at 63*G, 30 sec at 

40 72*C, and a final extension time of 2 min at 72*'C. 

Restriction enzyme digestion and poiyacryiamide electropfioresis. 

[0363] Gfol and Rsal and reaction buffer L were purchased from Boehringer-Mannheim, and Hhal from Pharmacia 
45 (Freiburg, Germany). For Cfol alone and simultaneous Cfol/Rsal digestion, 20 pL of amplified products were diluted 
with 15 |il water and 4 pL Boehringer-Mannheim buffer L; after addition of 10 units of appropriate restriction enzyme 
(s) the samples were incubated for 60 min at 37'C. The procedure for simultaneous Hhal/Rsal digestion required first 
digestion by Rsal in buffer L for one hour followed by addition of NaCI (50 mM end concentration) and Hhal, and 
additional incubation for one hour. 20 pL of the restriction digest were analyzed on a 12% poiyacryiamide gel as de- 
50 scribed elsewhere (Hixson (1990) J. Lipid Res. 31 :545-548). Recognition sequences of Rsal and Gfol (Hhal) are GT/ 
AG and GGG/G, respectively; masses of expected digestion fragments from the 252-mer amplified product with Gfol 
alone and the simultaneous double digest with Cfol (or Hhal) and Rsal are given in Table V. 

Thermo-PROBE. 

55 

[0364] PGR amplification was perfonned as described above, but with products purified with the Qiagen' Qiaquick' 
kit to remove unincorporated primers. Multiplex Thermo-PROBE was performed with 35 |al amplified product and 8 
pmol each of the codon 1 1 2 (5'-GCG GAG ATG GAG GAG GTG-3' SEQ ID. N0.43) and 1 58 (5'-GAT GCG GAT GAG 
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CTG CAG AAG-3' SEQ ID. NO. 44) detection primers in 20 \i\ including ~ 1 pmol purified biotinylated antisense template 
immobilized on streptavidin coated magnetic beads, 2.5 units Thermosequenase, 2|xl Thermosequenase buffer, 50 
|aM of each dNTP and 200 [iM of ddXTP, with the base identity of N and X as described in the text. Cycling conditions 
were: denaturation (94°C, 30 sec) followed by 30 cycles at 94''C (1 0 min) and 60°C (45 sec). 

5 

Sample preparation and analysis by MALDI-TOF MS. 

[0365] For precipitation (Stu Its etaL, (1991 ) Rapid Commun. Mass Spectrom. 5: 359-363) of both digests and PROBE 
products, 5^,1 3M ammonium acetate (pH 6.5), 0.5|il glycogen (10 mg/ml, Sigma), and 110 ^il absolute ethanol were 

10 added to 50 til of the analyte solutions and stored fori hour at room temperature. After 10 min centrifugation at 13,000 
X g the pellet was washed in 70% ethanol and resuspended in 1 \i\ 18 Mohm/cm HgO water. Where noted in the text, 
additional desalting was achieved by shaking 10-20|xL of ammonium saturated DOWEX (Fluka #44485) cation ex- 
change beads in 40|aL of analyte. The beads, purchased in the protonated form, were pre-treated with three 5 min 
spin-decant steps in 2M NH4OH, followed with HgO and 10 mM ammonium citrate. 

15 [0366] 0.35|iL of resuspended DNA was mixed with 0.35-1 .3[iL matrix solutions (Wu et al. (1 993) Rapid Commun. 
Mass Spectrom. 7:142-146) 0.7 IVI 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1:1 H20:CH3CN) on 
a stainless steel sample target disk and allowed to air dry preceding spectrum acquisition using a Thermo Bioanalysis 
Vision 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respec- 
tively. Theoretical average molecular masses (Mr(calc)) of the fragments were calculated from atomic compositions; 

20 the mass of a proton (1.08 Da) is subtracted from raw data values in reporting experimental molecular masses (M^ 
(exp)) as neutral basis. An external calibration generated from eight peaks (3000-1 8000 Da) was applied to all spectra. 

RESULTS 

25 Digestion with Cfoi alone. 

[0367] The inset to Figure 57a shows a 12% poiyacrylamide gel electrophoretic separation of an G3/G3 genotype 
after digestion of the 252 bp apo E amplified product with Cfoi. Comparison of the eiectrophoreticbands with a molecular 
weight ladder shows the cutting pattern to be as mostly as expected (Table V) for the e3/e3 genotype. Differences 

30 are that the faint band at approximately 25 bp is not expected, and the smallest fragments are not observed. The 
accompanying mass spectrum of precipitated digest products shows a similar pattern, albeit at higher resolution. Com- 
parison with Table V shows thatthe observed masses are consistent with those of single-stranded DNA; the combination 
of an acidic matrix environment (3-HPA, pK^S) and the absorption of thermal energy via interactions with the 337 nm 
absorbing 3-HPA upon ionization is known to denature short stretches of dsDNA under normal MALDI conditions (Tang, 

35 K et aL, (1994) Rapid Commun Mass Spectrom 8:183-186). 

[0368] The approximately 25-mers, unresolved with electrophoresis, are resolved by MS as three single stranded 
fragments; while the largest (7427 Da) of these may represent a doubly charged ion from the 14.8 kDa fragments (m 
= 14850, z = 2; m/z = 7425), the 6715 and 7153 Da fragments could result from PGR artifacts or primer impurities; all 
three peaks are not observed when amplified products are purified with Qiagen purification kits prior to digestion. The 

40 Table V 8871 Da 29-mer sense strand 3'-terminal fragment is not observed; the species detected at 9186 Da is con- 
sistent with the addition of an extra base (9187 - 8871 = 316, consistent with A) by the Taq -polymerase during PGR 
amplification (Hu, G etaL, (1993) DNA and Cell Biol 12:763-770). The individual single strands of each double strand 
with <35 bases (11 kDa) are resolved as single peaks; the 48-base single strands (Mr(calc) 14845 and 14858), however, 
are observed as an unresolved single peak at 14850 Da. Separating these into single peaks would require a mass 

45 resolution (m/£^m, the ratio of the mass to the peak width at half height) of 1 4850/1 3 ^ 1 1 40, nearly an order of magnitude 
greater than what is routine with the standard reflectron time-of -flight instrumentation used in this study; resolving such 
small mass differences with high performance instrumentation such as Fourier transform MS, which provides up to 
three orders of magnitude higher resolution in this mass range, has been demonstrated. The 91-mer single strands 
(Mr(calc)27849 and 28436) are also not resolved, even though this requires a resolution of only < 50. The dramatic 

50 decrease in peak quality at higher masses is due to metastable fragmentation (i.e. depurination) resulting from excess 
internal energy absorbed during and subsequent to laser irradiation. 

Simultaneous digestion with Cfoi and Rsal. 

55 [0369] Figure 57b (inset) shows a 1 2% poiyacrylamide gel electrophoresis separation of e3/G3 double digest prod- 
ucts, with bands consistent with dsDNA with 24, 31, 36, 48, and 55 base pairs, but not for the smaller fragments. 
Although more peaks are generated (Table V) than with Cfoi alone, the corresponding mass spectrum is more easily 
interpreted and reproducible since all fragments contain < 60 bases, a size range far more appropriate for MALDI-MS 
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if reasonably accurate values ( e.g. , 0.1%) are desired. For fragments in this mass range, the mass measuring 
accuracy using external calibration is -0.1% (i.e. _< + 10 Da at 10 kDa). Significant depurination (indicated in Figure by 
asterisk) is observed for all peaks above 10 kDa, but even the largest peak at 17171 Da is clearly resolved from its 
depurination peak so that an accurate can be measured. Although molar concentrations of digest products should 

5 be identical, some discrimination against those fragments with <1 1 bases is observed, probably due to their loss in the 
ethanol/glycogen precipitation step. The quality of MS results from simultaneous digestion with Cfol (or Hhal) and Rsal 
is superior to those with Cfol (or Hhal) alone, since the smaller fragments generated are good for higher mass accuracy 
measurements, and with all genotypes there is no possibility for dimer peaks overlapping with high mass diagnostic 
peaks. Since digestion by Rsal/Cfol and Rsal/Hhal produce the same restriction fragments but the former may be 

10 performed as a simultaneous digest since their buffer requirements are the same, this enzyme mixture was used for 
all subsequent genotyping by restriction digest protocols. 


Table V 


Mass and Copy Number of Expected Restriction Digest Products 

Tabie Va Cfoi Digestion^ 

W (-) 

e2/e2 

e2/e2 

e2/e2 

e2/e2 

e2/e2 

e2/e2 

5781, 5999 



1 


1 

2 

10752, 10921 


1 

1 

2 

2 

2 

14845, 14858 


1 

1 

2 

2 

2 

22102, 22440 



1 


1 

2 

25575, 25763 

2 

1 

1 




27849, 28436 

2 

2 

1 

2 

1 


Table Vb. Cfol/Rsal Digestion^ 

(+) (-) 

02/e2 

e2/e3 

e2/e4 

e3/e3 

e3/e4 

e4/e4 

3428, 4025 


1 

1 

2 

2 

2 

5283, 5880 



1 


1 

2 

5781, 5999 



1 


1 

2 

11279, 11627 

2 

2 

1 

2 

1 


14845, 14858 


1 

1 

2 

2 

2 

18269, 18848 

2 

2 

1 





^fol Invariant fragment masses: 184€, 2177, 2186, 2435, 4S24, 5004, 5412, 5750, 8871, 9628 Da. 

■^Cfol/Rsal Invariant fragment masses: 1848, 2177, 2186, 2436, 4S24, 5004, 5412, 5750, 6745, 7510, 8871, 9628, 16240, 17175 Da. 


Table VI 



ddT Mr (Gale) 

ddT Mr (Exp) 

ddC Mg (Calc) 

ddC Mr (Exp) 

e2/e2 

^591 8, ^6768 


^6536, b7387 


e2/e3 

^591 8, '=6768, ^7965 

5919, 6769, 7967 

^6536, ^6753, ^7387 

6542, 6752, 7393 

e2/e4 

^591 8, "6768, ^7965, ^8970 


^5903, t>6536, ^6753, ^7387 


e3/e3 

^591 8, 1=7965 

5918, 7966 

^6536, b6753 

6542, 6756 

e3/e4 

^591 8, ^7965, ^8970 

5914, 7959, 8965 

^5903, b6536, ^6753 

5898, 6533, 6747 

e4/e4 

^7965, 38970 

7966, 8969 

^5903, ^6753 

5900, 6752 


^From codon 112 detection primer (unextended 5629.7 Da). 
^From codon 158 detection primer (unextended 6480.3 Da). 
Dashed lines: this genotype not available from the analyzed pool of 100 patients. 


[0370] Figure 58a-c shows the ApoE E3/G3 genotype after digestion with Cfol and a vari ety of precipitation schemes; 
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equal volume aliquots of the same amplified product were used for each. The sample treated with a single precipitation 
(Figure 58a) from an ammonium acetate/ethanol/glycogen solution results in a mass spectrum characterized by broad 
peaks, especially at high mass. The masses for intense peaks at 5.4, 10.7, and 14.9 kDa are 26 Da (0.5%), 61 Da 
(0.6%), and 45 Da (0.3%) Da higher, respectively, than the expected values; the resolution (the ratio of a peak width 

5 at half its total intensity to the measured mass of the peak) for each of these is -50, and decreases with increasing 
mass. Such obsePk/ations are consistent with a high level of nonvolatile cation adduction; for the 1 0.8 kDa fragment, 
the observed mass shift is consistent with a greater than unit ratio of adductedinonadducted molecular ions. 
[0371] MS peaks from a sample redissolved and precipitated a second time are far sharper (Figure 58b), with res- 
olution values nearly double those of the corresponding Figure 58a peaks. Mass accuracy values are also considerably 

10 improved; each is within 0.07% of its respective calculated values, close to the independently determined instrumental 
limits for DNA measurement using 3-HPA as a matrix. Single (not shown) and double (Figure 58C) precipitations with 
isopropyl alcohol (IPA) instead of ethanol result in resolution and mass accuracy values comparable to those for cor- 
responding ethanol precipitations, but enhanced levels of dimerization are observed, again potentially confusing meas- 
urements when such dimers overlap with higher mass "diagnostics" monomers present in the solution. EtOH/ammo- 

15 nium acetate precipitation with glycogen as a nucleation agent results in nearly quantitative recovery of fragments 
except for the 7-mers, serving as a simultaneous concentration and desalting step prior to MS detection. Precipitation 
from the same EtOH/ammonium acetate solutions in the absence of glycogen'results in far poorer recovery, especially 
at low mass. 

[0372] The results indicate that to obtain accurate (Mr(exp) values after either 1 PA and EtOH precipitations, a second 
20 precipitation Is necessary to maintain high mass accuracy and resolution. 

[0373] The ratio of matrix:digest product also affects spectral quality; severe suppression of higher mass fragments 
(not shown) observed with 1 :1 volume matrix: digest product (redissolved in 1|i,L) is alleviated by using a 3 - 5 fold 
volume excess of matrix. 

[0374] Apo E genotyping by enzymatic digestion. Codon 112 and 158 polymorphisms fall within Cfol (but not Rsal) 

25 recognition sequences. In the 252 bp amplified product studied here, invariant (i.e. cut in all genotypes) sites cause 
cuts after bases 31, 47, 138, 156, 239, and 246. The cutting site after base 66 is only present for E4, while that after 
base 204 is present in G3 and G4; the G2 genotype is cut at neither of these sites. These differences in the restriction 
pattern can be demonstrated as variations in mass spectra. Figure 59 shows mass spectra from several ApoE geno- 
types available from a pool of 1 00 patients (Braun, A etal., (1 992) Hum. Genet. 89 :401-406). Vertical dashed lines are 

30 drawn through those masses corresponding to the expected Table V diagnostic fragments; other labeled fragments 
are invariant. Referring to Table V, note that a fragment is only considered "invariant" if it is present in duplicate copies 
for a given allele; to satisfy this requirement, such a fragment must be generated in each of the G2m G3, and G4 alleles. 
[0375] The spectrum in Figure 59a contains all of the expected invariant fragments above 3 kDa, as well as diagnostic 
peaks at 3428 and 4021 (both weak), 11276 and 11627 (both intense), 14845, 18271 , and 18865 Da. The spectrum 

35 in Figure 59b is nearly identical except that the pair of peaks at 18 kDa is not detected, and the relative peak intensities, 
most notably among the 11-18 kDa fragments, are different. The spectrum in Figure 59c also has no 1 8 kDa fragments, 
but instead has new low intensity peaks between 5-6 kDa. The intensity ratios for fragments above 9 kDa are similar 
to those of Figure 59b except for a relatively lower 1 1 kDa fragment pair. Figure 59d, which again contains the 5-6 kDa 
cluster of peaks, is the only spectrum with no 11 kDa fragments, and like the previous two also has no 18 kDa fragment. 

40 [0376] Despite the myriad of peaks in each spectrum, each genotype can be identified by the presence and absence 
of only a few of the Table Vb diagnostic peaks. Due to the limited resolution of the MALDI-TOF instrumentation em- 
ployed, the most difficult genotypes to differentiate a re those based upon the presence or absence of thefour diagnostic 
fragments between 5.2 and 6.0 kDa characteristic of the e4 allele, since these fragments nearly overlap with several 
invariant peaks. It has been found herein that the 5283 Da diagnostic fragment overlaps with a depurination peakfrom 

45 the 5412 Da invariant fragment, and the 5781 Da diagnostic peak is normally not completely resolved from the 5750 
Da invariant fragment. Thus, distinguishing between an e2/e4 and e2/e3, or between an e3/e4andan e3/e3 allele, 
relies upon the presence or absence of the 5880 and 5999 Da fragments. Each of these is present in Figures 59c and 
59d, but not in 59a or 59b. 

[0377] The genotype of each of the patients in Figure 59 can be more rapidly identified by reference to the flowchart 
50 in Figure 60. Consider the Figure 59a spectrum. The intense pair of peaks at 11 kDa discounts the possibility of ho- 
mozygous G4, but does not differentiate between the other five genotypes. Likewise, the presence of the unresolved 
14.8 kDa fragments is inconsistent with homozygous G2, but leaves four possibilities (G2/G3, G2/G4, G3/G3, G3/G4). 
Of these only G2/G3 and G2/G4 are consistent with the 18 kDa peaks; the lack of peaks at 5283, 5879, 5779, and 
5998, Da indicate that the Figure 59a sample is G2/e3. Using the same procedure, the Figures 59b-d genotypes can 
55 be identified as e3/e3, e3/e4, and e4/e4, respectively. To date, all allele identifications by this method have been 
consistent with, and in many cases more easily interpreted than, those attained via conventional methods. The assign- 
ment can be further confirmed by assuring that fragment intensity ratios are consistent with the copy numbers of Table 
V. For instance, the 14.8 kDa fragments are of lower intensity than those at 16- 17 kDa in Figure 59a, but the opposite 
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is seen in Figures 59b-d. This is as expected, since in the latter tliree genotypes the 1 4.8 kDa fragments are present 
in duplicate, but the first is a heterozygote containing G2, so that half of the amplified products do not contribute to tlie 
14.8 IcDa signal. Liltewise, comparison of the 1 1 kDa fragment intensify to those at 9.6 and 14.8 kDa indicate that this 
fragment is double, double, single, and zero copy in Figures 59a, d, respectively. These data confirm that MALDI can 

5 perform in a semi-quantitative way under these conditions. 

[0378] ApoE genotyping by Primer Oligo Base Extension (PROBE). Th e P ROBE reaction was also tested as a means 
of simultaneous detection of the codon 1 1 2 and 1 58 polymorphisms. A detection primer is annealed to a single-stranded 
PCR-amplified template so that its 3' terminus is just downstream of the variable site. Extension of this primer by a 
DNA polymerase in the presence of three dIMTPs and one ddXTP (that is not present as a dNTP) results in products 

10 whose length and mass depend upon the identity of the polymorphic base. Unlike standard Sanger type sequencing, 
in which a particular base-specific tube contains -99% dXTP and -1% ddXTP, the PROBE mixture contains 100% of 
a particular ddXTP combined with the other three dNTPs. Thus with PROBE a full stop of all detection primers is 
achieved after the first base complementary to the ddXTP is reached. 

[0379] For the G2/E3 genotype, the PROBE reaction (mixture of ddTTP, dATP, dCTP, dGTP) causes a Mr(exp) shift 
15 of the codon 1 1 2 primer to 591 9 Da, and of the codon 1 58 primer to 6769 and 7967 Da (Table VI); a pair of extension 
products results from the single codon 158 primer because the G2/E3 genotype is heterozygous at this position. Three 
extension products (one from codon 158, two from 112) are also observed from the heterozygote E3/e4 (Figure 61c 
and Table VI), while only two products (one from each primer) are observed from the Figure 61b (G3/G3) and Figure 
59d (G4/G4) homozygote alleles. Referring to Table VI, each of the available alleles result in all expected ddT reaction 
20 product masses within 0.1% of the theoretical mass, and thus each is unambiguously characterized by this data alone. 
Further configuration of the allele identities may be obtained by repeating the reaction with ddCTP (plus dATP, dTTP, 
dGTP); these results, summarized also in Table VI, unambiguously confirm the ddT results. 

[0380] Appropriateness of the methods. Comparison of Figures 59 (restriction digestion) and 61 (PROBE) indicates 
that the PROBE method provides far more easily interpreted spectra for the multiplex analysis of codon 112 and 158 

25 polymorphisms than does the restriction digest analysis. While the digests generate up to -25 peaks per mass spectrum 
and in some case diagnostic fragments overlapping with invariant fragments, the PROBE reaction generates a maxi- 
mum of only two peaks per detection primer (i.e. polymorphism). Automated peak detection, spectrum analysis, and 
allele identification would clearly be far more straightforward for the latter. Spectra for highly multiplexed PROBE, in 
which several polymorphic sites from the same or different amplified products are measured from one tube, are also 

30 potentially simple to analyze. Underscoring its flexibility, PROBE data analysis can be further simplified by judicious a 
priori choice of primer lengths, which can be designed so that no primers or products can overlap in mass. 
[0381] Thus while PROBE is the method of choice for large scale clinical testing of previously well characterized 
polymorphic sites, the restriction digest analysis as described here is ideally suited to screening for new mutations. 
The identity of each of the two polymorphisms discussed in this study affects the fragment pattern; if this is the only 

35 infonnation used, then the MS detection is a faster alternative to conventional el ectrop heretic separation of restriction 
fragment length polymorphism products. The exact measurement of fragment values can also give information on 
about sites completely remote from the enzyme recognition site since other single point mutations necessarily alter 
the mass of each of the single strands of the double stranded fragment containing the mutation. The 252 bp amplified 
product could also contain allelic variants resulting in, for example, previously described Glyl27 Asp (Weisgraber, KH 

40 et al., (1984) J. Clin. Invest. 73:1024-1033), Argl36Ser (Wardell, MR ef ai., (1987) J. Clin. Invest. 80:483-490), 
Argl42Cys (Horie, Y et at., (1992) J. Biol. Chem. 267 :1962-1968), Arg145Cys (Rail SC Jr et aL, (1982) Proc. Natl. 
Acad. Sci. U.S.A. 79 :4696-4700), Lysl46Glu (Mann, WAetaL, (1 995) J. Clin. Invest. 96 :1 1 QO-1 1 07), orLysl46Gln (Smit, 
M et al., (1 990) J. Lipid Res. 31 :45-53) substitutions. The G^A base substitution which codes for the Glyl27 Asp amino 
acid substitution would result in a -1 6 Da shift in the sense strand, and in a + 15 Da (C^T) shift in the antisense strand, 

45 but not in a change in the restriction pattern. Such a minor change would be virtually invisible by electrophoresis; 
however, with accurate mass determination the substitution could be detected; the invariant 55-mer fragment at 1 6240 
(sense) and 17176 Da would shift to 16224 and 17190 Da, respectively. Obtaining the mass accuracy required to detect 
such minor mass shifts using current MALDI-TOF instrumentation, even with internal calibration, is not routine since 
minor unresolved adducts and/or poorly defined peaks limit the ability for accurate mass calling. With high performance 

50 electrospray ionization Fourier transform (ESI-FTMS) single Da accuracy has been achieved with synthetic oligonu- 
cleotides (Little, DP etaf., (1995) Proc. Natl. Acad. Sci. U.S.A. 92 :2318-2322) up to 100-mers (Little, DP etaf., (1994) 
J. Am. Chem. Soc. 116 :4893-4897), and similar results have recently been achieved with up to 25-mers using MALDI- 
FTMS (LI, Y etai., (1996) Anal. Chem. 68 :2090-2096). 

55 
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EXAMPLE 13 

A Method for Mass Spectrometric Detection of DNA Fragments Associated With Telomerase Activity 

5 INTRODUCTION 

[0382] One-fourth of all deaths in the United States are due to malignant tumors (R.K. Jain, (1996) Science 271 : 
1 079-1 080) . For diagnostic and therapeutic purposes there is a high interest in reliable and sensitive methods of tumor 
cell detection. 

10 [0383] Malignant cells can be distinguished from normal cells by different properties. One of those is the immortali- 
zation of malignant cells which enables uncontrolled cell-proliferation. Normal diploid mammalian cells undergo a finite 
number of population doublings in culture, before they undergo senescence. It is supposed that the number of popu- 
lation doublings in culture, before they undergo senescence. It is supposed that the number of population doublings 
Is related to the shortening of chromosome ends, called telomers, in every cell division. The reason for said shortening 

15 is based on the properties of the conventional semiconsen/ative replication machinery. DNA polymerases only work 
in 5' to 3' direction and need an RNA primer 

[0384] Immortalization is thought to be associated with the expression of active telomerase. Said telomerase is a 
ribonucleoprotein catalyzing repetitive elongation of templates. This activity can be detected in a native protein extract 
of telomerase containing cells by a special PCR-system (N.W. Kim et aL (1994) Science 266:201 1 -201 5) known as 
20 telomeric repeat amplification protocol (TRAP). The assay, as used herein, is based on the telomerase specific exten- 
sion of a substrate primer (TS) and a subsequent amplification of the telomerase specific extension products by a PGR 
step using a second primer (bioCX) complementary to the repeat structure. The characteristic ladder fragments of 
those assays are conventionally detected by the use of gel electrophoretic and labeling or staining systems. These 
methods can be replaced by MALDI-TOF mass spectrometry leading to faster accurate and automated detection. 

25 

MATERIALS AND METHODS 

Preparation of cells 

30 [0385] 1 X IQS cultured telomerase-positive cells were pelleted, washed once with PBS (137 mM NaCI, 2.7 mM KCI, 
4.3 mM Na2HP04»7H20, 1 .4 mM KH2PO4 in sterile DE PC water). The prepared cells may be stored at -75'C. Tissue 
samples have to be homogenized, according to procedures well known in the art, before extraction. 

Telomerase extraction 

35 

[0386] Pellet was resuspended In 200 \i\ CHAPS lysis buffer (10 mM Tris-HCI pH 7.5, 1 mM MgClg, 1 mM EGTA, 
0.1 mM benzamldlne, 5 mM p-mercaptoethanol, 0.5% CHAPS, 10% glycerol) and Incubated on Ice for 30 min. The 
sample was centrifuged at 12,000 g for 30 min at 4''C. The supernatant was transferred into a fresh tube and stored 
at 75'C until use. 

40 

TRAP-assay 

[0387] 2 [l\ of telomerase extract were added to a mixture of lOX TRAP buffer (200 mM Tris-HCI pH 8.3, 15 mM 
MgClg, 630 mM KCI, 0.05% Tween 20, 10 mM EGTA) 50x dNTP-mix (2.5 mM each dATP, dTTP, dGTP, and dCTP), 
45 1 0 pmol of TS primer and 50 pmol of bio CX primer In a final volume of 50 |il. The mixture was incubated at 30*0 for 
10 minutes and 5 min. at 94^*0, 2 units of Taq Polymerase were added and a PGR was performed with 30 cycles of 
94*C for 30 seconds, 50'G for 30 seconds and 72'*C for 45 seconds. 

Purification of TRAP-assay products 

50 

[0388] For every TRAP-assay to be purified, 50 |j,l Streptavidin M-280 Dynabeads (10 mg/ml) were washed twice 
with 1x BW buffer (5 mM Tris-HCI, pH 7.5, 0.5 mM EDTA, 1 M NaCI). 50 |j,l of 2x BW buffer were added to the PGR 
mix and the beads were resuspended in this mixture. The beads were incubated under gentle shaking for 15 min. at 
ambient temperature. The supernatant was removed and the beads were washed twice with 1x BW buffer. To the 
55 beads 50 [il 25% ammonium hydroxide were added and incubated at GO^C for 10 min. The supernatant was saved, 
the procedure repeated, both supernatants were pooled and 300 ^li ethanol (1 00%) were added. After 30 min. the DNA 
was pelleted at 13,000 rpm for 12 min., the pellet was air-dried and resuspended in 600 nl ultrapu re water. 
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MALDI-TOF MS of TRAP-assay products 

[0389] 300 nl sample were mixed with 500 nl of saturated matrix-solution (3-HPA:ammonium citrate = 10:1 molar 
ratio in 50% aqueous acetonitrile), dried at ambient temperature and introduced into tine mass spectrometer (Vision 
5 2000, Finigan MAT). All spectra were collected in reflector mode using external calibration. 

Sequences and masses 

[0390] 

10 

bioCX: d(bio-CCC TTA CCC TTA CCC TTA CCC TAA SEQ ID NO. 45), mass: 7540 Da. 
TS: d(AAT CCG TGC AGC AGA GTT SEQ ID N0.46), mass: 5523 Da. 
Telomeric-repeat structure: (TTAGGG)^, mass of one repeat: 1909.2 Amplification products: 
TS elongated by three telomeric repeats (first annplification product): 12452 Da. (N3) 
15 TS elongated by four telomeric repeats: 14361 Da. (N4) 

TS elongated by seven telomeric repeats: 20088 Da. (N7) 

RESULTS 

20 [0391] Figure 62 depicts a section of a TRAP-assay MALDI-TOF nnass spectrum. Assigned are the primers TS and 
bioCX at 5497 and 7537 Da, respectively (calculated 5523 and 7540 Da). The signal marked by an asterisk represents 
n-1 primer product of chemical DNA synthesis. The first telomerase specific TRAP-assay product is assigned at 1 2775 
Da. This product represents a 40-mer containing three telomeric repeats. Due to primer sequences this is the first 
expected amplification product of a positive TRAP-assay. The product is elongated by an additional nucleotide due to 

25 extendase activity of Taq DNA polymerase (calculated non-extended product: 1 2452 Da, by A extended product: 12765 
Da). The signal at 6389 Da represents the doubly charged ion of this product (calculated: 6387 Da). Figure 63 shows 
a section of higher masses of the same spectrum as depicted in figure 62, therefore the signal at 1 2775 Da is identical 
to that in figure 62. TheTRAP-assay product containing seven telomeric repeats, representing a 64-mer also elongated 
by an additional nucleotide, is detected at 20322 Da (calculated: 20395 Da). The signals marked 1,2,3 and 4 cannot 

30 be base-line resolved. This region includes of: 1. signal of dimeric n-1 primer, 2. second TRAP-assay amplification 
product, containing 4 telemeric repeats and therefore representing a 46-mer (calculated: 14341 Da/14674 Da for ex- 
tendase elongated product) and 3. dimeric primer-ion and furthemnore all their corresponding depurination signals. 
There is a gap observed between the signals of the second and fifth extension product. This signal gap corresponds 
to the reduced band intensities observed in some cases for the third and fourth extension product in autoradiographic 

35 analysis of TRAP-assays (N.W. Kim et al. (1994) Science 266 :2013). 

[0392] The above-mentioned problems, caused by the dimeric primer and related signals, can be overcome using 
an ultrafiltration step employing a molecularweight cut-off membrane for primer removal priorto MALDI-TOF-MS anal- 
ysis. This will permit an unambiguous assignment of the second amplification product. 

40 EXAMPLE 14 

A method for Detecting Neuroblastoma-Specific Nested RT-amplified products Via IMALDI-TOF IVIass 
Spectrometry 

45 Introduction 

[0393] Neuroblastoma is predominantly a tumor of early childhood with 66% of the cases presenting in children 
younger than 5 years of age. The most common symptoms are those due to tumor mass, bone pain, or those caused 
by excessive catecholamine secretion. In rare cases, neuroblastoma can be identified prenatally (R.W. Jennings et al, 
50 (1993) J. Ped. Surgery 28 :1168-11 74). Approximately 70% of all patients with neuroblastoma have metastatic disease 
at diagnosis. The prognosis is dependent on age at diagnosis, clinical stage and other parameters. 
[0394] For diagnostic purposes there is a high interest in reliable and sensitive methods of tumor cell detection, e. 
^, in control of autologous bone marrow transplants or on-going therapy. 

[0395] Since catecholamine synthesis is a characteristic property of neuroblastoma cells and bone marrow cells lack 
55 this activity (H. Naito etal. , (1991) Eur. J. Cancer 27 :762-765), neuroblastoma cells or metastasis in bone marrow can 
be identified by detection of human tyrosine 3-hydroxylase (E.G. 1.14.16.2, hTH) which catalyzes the first step in 
biosynthesis of catecholamines. 

[0396] The expression of hTH can be detected via reverse transcription (RT) polymerase chain reaction (PGR) and 
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the amplified product can be analyzed via MALDI-TOF mass spectrometry. 

Materials and methods 

5 Ceil- or tissue-treatment 

[0397] Cultures cells were pelleted (10 min. 8000 rpm) and washed twice with PBS (137 mM NaCI,2.7mM KCI,4.3 
mM Na2HP04»7H20, 1 .4 mM KH2PO4 in sterile PERC water). The pellet was resuspended in 1 ml lysis/binding buffer 
(100 mMTrls-HCi, pH 8.0, 500 mM LICI, lOmM EDTA, 1 % Li-dodecyl sulfate, 5 mM DTT) until the solution becomes 
10 viscose. Viscosity was reduced by DNA-shearstep using a 1 ml syringe. The lysate maybe stored in -75*C or processed 
further directly. Solid tissues ( e.g. , patient samples) have to be homogenized before lysis. 

Preparatior} of magnetic Oligo-dT (25) beads 

15 [0398] 100 |iL beads per 1x10^ cells were separated from the storage buffer and washed twice with 200 \iL lysis/ 
binding buffer. 

tsoiation o1poiyA+ RNA 

20 [0399] The cell lysate was added to the prepared beads and Incubated for 5 min. at ambient temperature. The beads 
were separated magnetically for 2-5 min. and washed twice with 0.5 ml LDS (10 mM Tris-HCI, pH 8.0, 0.15 M LiCI, 1 
mM EDTA, 0.1% LIDS). 

Solid-phase first-strand cDNA synthesis 

25 

[0400] The poly A+RNA containing beads were resuspended In 20 \xL of reverse transcription mix (50 mM Tris-HCI, 
pH 8.3, 8 mM MgClg, 30 mM KCI, 10 mM DTT, 1.7 mM dNTPs, 3 U AMV reverse transcriptase) and incubated for 1 
hour at 45*'C (with a resuspension step all ten min.). The beads were separated from the reverse transcription mix, 
resuspended in 50 [iL of elution buffer (2 mM EDTA pH 8.0) and heated to 95"'G for 1 min. fur elution of the RNA. The 
30 beads with the cDNA first-strand can be stored in TB (0.089 M Tris-base, 0.089 M boric acid, 0.2 mM EDTA pH 8.0), 
TE 10 mM Tris-HCI, 0.1 mM EDTA, pH 8.0) or 70% ethanol for further processing. 

Nested polymerase chain reaction 

35 [0401] Beads containing cDN A first-strand were washed twice with 1x PCR buffer (20 mM Tris-HCI pH 8.75, 10 mM 
KCI, 10 mM (NH4)2S04, 2 mM MgS04, 0.1% Triton X-100, 0.1 mg bovine serum albumin) and resuspended in PCR 
mix (containing 100) pmol of each outer primer, 2.5 u P/u(exo-) DNA polymerase, 200 \iM of each dNTP and PCR 
buffer in a final volume of 50 ^iL). The mixture was incubated at 72"'C 1 min. and amplified by PCR for 30 cycles, for 
the nested reaction: 1 p.L of the first PCR was added as template to a PCR mix d(as above but nested primers instead 

40 of outer primers) and subjected to the following temperature program: 94''C 1 min., 65*C 1 min. and 72*C 1 min. for 
20 cycles. 

Purification of nested amplified products 

45 [0402] Primers and low-molecular reaction by-products are removed using 10,000 Da cut-off ultrafiltration-unit. Ul- 
trafiltration was performed at 7,500 g for25 minutes. For every PCR to be purified, 50 [iL Streptavidin M-280 Dynabeads 
(10 mg/ml) were washed twice with IxBW buffer (5 mM Tris-HCI, pH 7.5, 0.5 mM EDTA, 1 M NaCI), added to the 
ultrafiltration membrane and incubated under gentle shaking for 15 min. at ambient temperature. The supernatant was 
removed and the beads were washed twice with 1xBW buffer. 50 |xL 25% ammonium hydroxide were added to the 

50 beads and incubated at ambient temperature for 1 0 min. The supernatant was saved, the procedure repeated, both 
supernatants were pooled and 300 \xL ethanol (100%) were added. After 30 min. the DNA was pelleted at 13,000 rpm 
for 12 min., the pellet was air-dried and resuspended in 600 nl uitrapure water. 

MALDI-TOF MS of nested amplified products 

55 

[0403] 300 nl sample was mixed with 500 nl of saturated matrix-solution (3-HPA: ammonium citrate = 10:1 molar 
ratio in 50% aqueous acetonitrlle), dried at ambient temperature and Introduced Into the mass spectrometer (Vision 
2000, Finigan MAT). All spectra were collected in reflector mode using external calibration. 
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Outer primers: 
[0404] 

5 hTH1 : d(TGT CAG AGC TGG ACA AGT GT SEQ ID NO:47) 

hTH2: d(GAT ATT GTC TTC CCG GTA GC SEQ ID NO:48) Nested primers: 

bio-hTH d(bio-CTC GGA CCA GGT GTA CCG CC SEQ ID NO:49), mass:6485 Da 

hTH6; d(CCT GTA CTG GAA GGC GAT CTC SEQ ID NO:50), rnass:6422 21 Da 

mass of biotinylated single strand amplified product: 19253:6 Da 
10 mass of nonblotinylated single strand amplified product: 1 8758.2 Da 

Results 

[0405] A MALDI-TOF mass spectrum of a human tyrosine 3-hydroxylase (hTH) specific nested amplified product 
15 (61 -mer) Is depicted in figure 64. The signal at 1 8763 Da corresponds to non-blotlnylated strand of the amplified product 
(calculated: 18758.2 Da, mass error: 0.02 Da). The signals below 10,000 and above 35,000 Da are due to multiply 
charged and dimeric amplified product-ions, respectively. 

[0406] The product was obtained from a solid phase cDNA derived In a reverse transcription reaction from 1x10® 
cells of a neuroblastoma cell-line (L-A-N-1) as described above. The cDNA first-strand was subjected to a first PGR 
20 using outer primers (hTHI and hTH2), an aliquot of this PGR was used as template in a second PGR using nested 
primers (biohTH and hTH6). The nested amplified product was purified and MALDI-TOF MS analyzed: 
[0407] The spectrum In Fig. 64 demonstrates the possibility of neuroblastoma cell detection using nested RT-PGR 
and MALDI-TOF MS analysis. 

25 EXAMPLE 15 

Rapid Detection of the RET Proto-oncogene Codon 634 IVIutatlon Using IVIass Spectrometry 

Material and Methods 

30 

Probe 

[0408] The Identity of codon 634 In each of the three alleles was confirmed by Rsal enzymatic digestion, single strand 
conformational polymorphism or Sanger sequencing. Exon 11 of the RET gene was PGR amplified (40 cycles) from 

35 genomic DNA using Taq- Polymerase (Boehrlnger-Mannhelm) with 8 pmol each of 5' -biotinylated forward (5'-blotln- 
CAT GAG GCA GAG CAT ACG CA-3' SEQ ID NO:51) and unmodified reverse {5'-GAC AGC AGC ACC GAG ACG 
AT-3' SEQ ID NO:52) primer per tube; amplified products were purified using the QIagen (QIAquIck" kit to remove 
unincorporated primers. 15 |al of amplified product were Immobilized on 10 |aL (10 mg/mL) Dynal streptavidln coated 
magnetic beads, denatured using the manufacturer's protocol, and the supernatant containing antisense strand dis- 

40 carded, the PROBE reaction was performed using thermoSequenase (TS) DNA Polymerase (Amersham) and Phar- 
macia dNTP/ddNTPs. 8 pmol of extension primer (5'-CGG CTG CGA TCA CCG TGC GG-3' SEQ I D rvJO:53) was added 
to 13 M.L H2O, 2^iL TS-buffer, 2\iL 2mM ddATP (or ddTTP), and 2^1 of 0.5 mM dGTP/dCTP/dTTP (or dGTP/DCTP/ 
dATP), and the mixture heated for 30 sec @ 94^*0, followed by 30 cycles of 10 sec @ 94°G and 45 sec @ 50°G; after 
a 5 min. incubation @ 95*C, the supernatant was decanted, and products were desalted by ethanol precipitation with 

45 the addition of 0.5 |iL of 10mg/mL glycogen. The resulting pellet was washed In 70% ethanol, air dried, and suspended 
In 1 |iL HgO. 300 nL of this was mixed with the MALDI matrix (0.7 M 3-hydroxypicolinlc acid, 0.07 M ammonium citrate 
In 1 :1 HgOiCHgCN) on a stainless steel sample probe and air dried. Mass, spectra were collected on a Thermo Blo- 
nalysls Vision 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, 
respectively. Experimental masses (mr(exp)) reported are those of the neutral molecules as measured using external 

50 calibration. 

Direct Measurement of Diagnostic Products 

[0409] PGR amplifications conditions for a 44 bp region containing codon 634 were the same as above but using 
55 pfu polymerase; the forward primer contained a ribonucleotide at its 3'-terminus (forward, 5' -GAT CCA CTG TGC G AC 
GAG C (SEQ ID ISI0:54) -ribo; reverse, 5'-GCG GCT GCG ATC ACC GTG C (SEQ ID NO:55). After product Immobi- 
lization and washing, 80 \iL of 12.5% NH4OH was added and heated at 80'C overnight to cleave te primer from 44-mer 
(sense strand) to give a 25-mer. Supernatant was pipetted off while still hot, dried resuspended in 50 [lL HgO, preclp- 
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itated, resuspended, and measured by MALDI-TOF as above. MALDI-FTMS spectra of 25-mer synthetic analogs were 
collected as previously described (Li, Y. et aL, (1996) Anal. Chem. 68:2090-2096); briefly, 1-10 pmol DNA was mixed 
1:1 with matrix on a direct insertion probe, admitted into the external ion source (positive ion mode), ionized upon 
irradiance with a 337 nm wavelength laser pulse, and transferred via rf-only quadruple rods into a 6.5 Tesia magnetic 
5 field where they were trapped collisionally. After a 1 5 second delay, ions were excited by a broadband chirp pulse and 
detected using 256K data points, resulting in time domain signals of 5 s duration. Reported (neutral) masses are those 
of the most abundant isotope p6al< after subtracting the mass of the charge carrying proton (1 .01 Da). 

Results 

10 

[0410] The first scheme presented utilizes the PROBE reaction shown schematically in Figure 65. A 20-mer primer 
is designed to bind specifically to a region on the complementary template downstream of the mutation site; upon 
annealing to the template, which is labelled with biotin and immobilized to streptavidin coated magnetic beads, the 
PROBE primer Is presented with a mixture of the three deoxynucleotide triphosphates (dNTPs), a di-dNTR (ddNTR), 
15 and a DNA polymerase (Figure 65). The primer is extended by a series of bases specific to the Identity of the variable 
base in codon 634; for any reaction mixture (e.g. , ddA-i-dT-i-dC-i-dG), three possible extension products representing 
the three alleles are possible (Figure 65). 

[0411] For the negative control (Figure 66), the PROBE reaction with ddATP-i-dNTPs (N = T, C, G) causes a IVIr(exp) 
shift of the primer from 6135 to 6726 Da (Am + 591). The absence of a peak at 6432 rules out a G->A mutation (Figure 

20 65); the mass of the single observed peak Is more consistent with extension by C-ddA (Mr(calc) 6721 , +0.07% error) 
than by T-ddA (Mr(calc) 6736, -0.15% error) than of AgTG2G expected for C^A mutant. Combining the ddA and ddT 
reaction data, it is clear that the negative control is as expected homozygous normal at codon 634. 
[0412] The ddA reaction for patient 1 also results in a single peak (M^(exp) = 6731) between expected values for 
wildtype and C^T mutation (Figure 65b). The ddT reaction, however, results in two clearly resolved peaks consistent 

25 with a heterozygote wildtype (Mr(exp) 8249, -i-0.04% mass error)/C^T mutant (Mr(exp) 6428 Da, -i-0.08% mass error). 
For patient 2, the pair of Figure 66c ddA products represent a heterozygote C->A (IVIr(exp) 6431 , -0.06% mass error)/ 
normal (Mr(exp) 6719, -0.03% mass error) allele. The ddT reaction confirms this, with a single peak measured at 8264 
Da consistent with unresolved wildtype and C-^A alleles. The value of duplicate experiments is seen by comparing 
Figures 66a and 66b; while for patient 1 the peak at 6726 from the ddA reaction represents only one species, similar 

30 peak from patient 1 is actually a pair of unresolved peaks differing in mass by 15 Da. 

[0413] An alternate scheme for point mutation detection is differentiation of alleles by direct measurement of diag- 
nostic product masses. A 44-mer containing the RET634 site was generated by the PGR, and the 19-mersense primer 
removed by NH4OH cleavage at a ribonucleotide at its 3' terminus. 

[041 4] Figure 67 shows a series of MALDI-FTIVIS spectra of synthetic analogs of short amplified products containing 
35 the RET634 mutant site. Figures 67a-c and 67d-f are homozygous and heterozygous genotypes, respectively. An 
internal calibration was done using the most abundant Isotope peak for the wildtype allele; application of this (external) 
calibration to the five other spectra resulted in better than 20 ppm mass accuracy for each. Differentiation by mass 
alone of the alleles is straightforward, even for heterozygote mixtures whose components differ by 1 6.00 (Figure 67d), 
2501 (Figure 67e), or 9.01 Da (Figure 65f). The value of high performance IVIS is clear when recognition of small DNA 
40 mass shifts Is the basis for diagnosis of the presence or absence of a mutation. The recent reintroductlon of delayed 
extraction (DE) techniques has improved the performance of MALDI-TOF with shorts DN As (Roskey, M.T. etaL, (1996) 
Anal. Chem. 68 :941-946); a resolving power (RP) of>103 has been reported for a mixed-base 50-mer, and a pair of 
31 -mere with a C or a T (Am 15 Da) at a variable position resolved nearly to baseline. Thus DE-TOF-IVIS has demon- 
strated the RP required for separation of the individual components of heterozygotes. Even with DE, however, the 
45 precision of DNA mass measurement with TOF Is typically 0.1 % (8 Da at 8 kDa) using external calibration, sufficiently 
high to result in incorrect diagnoses. Despite the possibility of space charge induced frequency shifts (Marshall, A.G. 
etai. (1 991) Anal. Chem. 63:21 5A-229A), MALDI-FTMS mass errors are rarely as high as 0.005% (0.4 Da at 8 KDa), 
making internal calibration unnecessary. 

[0415] The methods for DNA point mutation presented here are not only applicable to the analysis of single base 
50 mutations, but also to less demanding detection of single or multiple base insertions or deletions, and quantification 
of tandem two, three, or four base repeats. The PROBE reaction yields products amenable to analysis by relatively 
low performance ESI or MALDI instrumentation; direct measurement of short amplified product masses is an even 
more direct means of mutation detection, and will likely become more widespread with the increasing Interest In high 
performance MS available with FTMS. 

55 


65 


EP 1 164 203 A2 


EXAMPLE 16 

Immobilization of nucleic acids on solid supports via an acid-labile covalent bifunctional trityl linker 

5 [041 6] Aminolinked DNA was prepared and purified according to standard methods. A portion (1 Oeq) was evaporated 
to dryness on a speedvac and suspended in anhydrous DMF/pyridine (9:1 ; 0.1 ml). To this was added tlie chlorotrityl 
chloride resin (1 eq, 1.05jamol/mg loading) and the mixture was shal<6n for 24 hours. The loading was checked by 
taking a sample of the resin, detritylating this using 80% AcOH, and measuring the absorbance at 260nm. Loading 
wasca. 150pmol/mg resin. 

10 [0417] In 80% acetic acid, the half-life of cleavage was found to be substantially less than 5 minutes-this compares 
with trityl ether-based approaches of half-lives of 105 and 39 minutes lor para and mefa substituted bifunctional dimeth- 
oxytrityl linkers respectively. Preliminary results have also indicated that the hydroxy picolinic acid matrix alone is 
sufficient to cleave the DNA from the chlorotrityl resin. 

15 EXAMPLE 17 

immobilization of nucleic acids on solid supports via hydrophobic trityl linker 

[0418] The primer contained a 5'-dimethoxytrityl group attached using routine trityl-on DNA synthesis. 
20 [0419] CIS beads from an oligo purification cartridge (0.2 mg) placed in a filter tip was washed with acetonitrile, then 

the solution of DNA (50 ng in 25 |j,l) was flushed through. This was then washed with 5% acetonitrile in ammonium 

citrate buffer (70 mM, 250 |il). To remove the DNA form the CIB, the beads were washed with 40% acetonitrile in water 

(10jal) and concentrated to ca 2 |xl on the Speedvac. The sample was then submitted to MALDI. 

[0420] The results showed that acetonitrile/water at levels of ca.>30% are enough to dissociate the hydrophobic 
25 interaction. Since the matrix used in MALDI contains 50% acetonitrile, the DNA can be released from the support and 

successfully detected using MALDI-TOF MS (with the trityl group removed during the MALDI process). 

[0421] Figure 69 is a schematic representation of nucleic acid immobilization via hydrophobic trityl linkers. 

EXAMPLE 18 

30 

Immobilization of nucleic acids on solid supports via Streptavidin-lminobiotin 

Experimental Procedure 

35 [0422] 2-iminobiotin N-hydroxy-succinimid ester (Sigma) was conjugated to the oligonucleotides with a 3'- or5-'amino 
linker following the conditions suggested by the manufacturer. The completion of the reaction was confirmed by MALDI- 
TOF MS analysis and the product was purified by reverse phase HPLC. 

[0423] For each reaction, 0.1 mg of streptavidin-coated magnetic beads (Dynabeads M-280 Streptavidin from Dynal) 
were incubated with 80 pmol of the corresponding oligo in the presence of 1 M NaCI and 50 mM ammonium carbonate 

40 (pH 9.5) at room temperature for one hour. The beads bound with oligonucleotides were washed twice with 50 mM 
ammonium carbonate (pH 9.5). Then the beads were incubated in 2|il of 3-HPA matrix at room temperature for 2 min. 
An aliquot of 0.5 [i\ of supernatant was applied to MALDI-TOF. For biotin displacement experiment, 1 .6. mol of free 
biotin (80-fold excess to the bound oligo) in 1 i^l of 50 mM ammonium citrate was added to the beads. After a 5 min. 
incubation at room temperature, 1 |j.l of 3-HPA matrix was added and 0.5 \l\ of supernatant was applied to MALDI-TOF 

45 MS. To maximize the recovery of the bound iminobiotin oligo, the beads from the above treatment were again incubated 
with a 2 |il of 3-HPA matrix and 0.5 \i\ of supernatant was applied to MALDI-TOF MS. The matrix alone and free biotin 
treatment quantitatively released iminobiotin oligo off the streptavidin beads as shown in Figures 70 and 71 . 

EXAMPLE 19 

50 

Mutation Analysis Using Loop Primer Oligo Base Extension 
MATERIALS AND METHODS 

55 [0424] Genomic DNA. Genomic DNA was obtained from healthy individuals and patients suffering from sickle cell 
anemia. The wildtype and mutated sequences have been evaluated conventionally by standard Sanger sequencing. 
[0425] PCR-Amplification. PGR amplifications of a part of the p-globin was established and optimized to use the 
reaction product without a further purification step for capturing with streptavidin coated bead. The target amplification 
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for LOOP-PROBE reactions were performed with the loop-cod5 d(GAG TCA GGT GCG CCA TGC CTC AAA CAG 
ACA CCA TGG CGC, SEQ ID No. 58) as forward primer and p-H-bio d(TCT CTG TCT CCA CAT GCC CAG, SEQ 
ID. No. 59) as biotinylated reverse primer. The underlined nucieotide in the ioop-cod5 primer is mutated to introduce 
an invariant Cfol restriction site into the ampiicon and the nucleotides in italics are complementary to a part of the 

5 amplified product. The total PCR volume was SO^il including 200 ng genomic DNA, 1U Taq -polymerase (Boehringer- 
Mannheim, Cat# 1596594), 1.5 mM MgClg, 0.2 mM dNTPs (Boehringer-Mannheim, Ca# 1277049), and 10 pmol of 
each primer. A specific fragment of the p-globin gene was amplified using the following cycling condition: 5 min 94''C 
followed by 40 cycles of : 30 sec @ 94'C, 30 sec @ SG^C, 30 sec @ 72''G, and a final extension of 2 min at 72'C. 
[0426] Capturing and denaturation of biotinylated templates. 10\i\ paramagnetic beads coated with streptavidin 

10 (1 Omg/ml; Dynal, Dynabeads M-280 streptavidin Cat# 1 1 2.06) and treated with 5x binding solution (5 M NH4CI, 0.3M 
NH4OH) were added to 40 |a.l PCR volume (10)4.1 of the amplified product was saved for check electrophoresis). After 
incubation for 30 min at 37^0 the supernatant was discarded. The captured templates were denatured with 50 |xl 1 00 
mM NaOH for 5 min at ambient temperature, then washed once with 50 |^l 50 mM NH4OH and three times with 100 )il 
lOmM Trls.CI, pH 8.0. The single stranded DNA served as templates for PROBE reactions. 

15 [0427] Primer oiigo base extension (PROBE) reaction. The PROBE reactions were performed using Sequenase 2.0 
(USB Gat# E70775Z including buffer) as enzyme and dNTPs and ddNTPs supplied by Boehringer-Mannheim (Cat# 
1277049 and 1008382). The ratio between dNTPs (dCTP, dGTP, dTTP) and ddATP was 1:1 and the total used con- 
centration was 50 |iM of each nucleotide. After addition of 5[il 1 -fold Sequenase-bufferthe beads were incubated for 
5 min at 65*C and for 1 0 min at 37''G. During this time the partially self complementary primer annealed with the target 

20 site. The enzymatic reaction started after addition of 0.5 [i\ 100 mM dithiothreitol (DTT), 3.5 |xl dNTP/ddNTP solution, 
and 0.5 |jil Sequenase (0.8 U) and incubated at 37"C for 1 0 min. Hereafter, the beads were washed once in 1-fold TE 
buffer (10 mM Tris, ImM EDTA, pH 8.0). 

[0428] Cfol restriction digest. The restriction enzyme digest was performed in a total volume of 5|a.l using 1 0 U Cfol 
in 1 -fold buffer L purchased from Boehringer-Mannheim. The incubation time was 20 min at 37°C. 

25 

Conditioning of the diagnostic products for mass spectrometric analysis 

[0429] After the restriction digest, the supernatant was precipitated in 45 p.! HgO, 10 |il 3M NH4- acetate (pH 6.5), 
0.5 [il glycogen (1 0 mg/ml in water, Sigma, Cat# G1 765), and 110)il absolute ethanol for 1 hour at room temperature. 
30 After centrifugation at 13,000 g for 10 min the pellet was washed in 70% ethanol and resuspended in 2 p.1 18 Mohm/ 
cm H2O. The beads were washed in 1 00 \l\ 0.7 M NH4, citrate followed by 1 00 \l\ 0.05 M NH4. citrate. The diagnostic 
products were obtained by heating the beads in 2 [x\ 50 mM NH4OH at 80*0 for 2 min. 

Sample preparation and analysis on MALDI-TOF mass spectrometry. 

35 

[0430] Same preparation was performed by mixing 0.6 [xl of matrix solution (0.7 M 3-hydroxypicollnic acid, 0.07 M 
dibasic ammonium citrate in 1 :1 H20:CH3CN) with 0.3 \i\ of either resuspended DNA/glycogen pellet or supernatant 
after heating the beads in 50 mM NH4OH on asampletargetand allowedto air dry. The sample target was automatically 
introduced in to the source region of an unmodified Perspective Voyager MALDI-TOF operated in delayed extraction 
40 linearmode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical molecular mass (Mr(calc)) 
were calculated from atomic compositions; reported experimental (Mr(exp)) values are those of the singly-protonated 
form. 

RESULTS 

45 

[0431] The LOOP-PROBE has been applied to the detection of the most common mutation of codon 6 of the human 
p-globin gene leading to sickle cell anemia. The single steps of the method are schematically presented in figure 72. 
For the analysis of codon 6, a part of the p-globin gene was amplified by PCR using the biotinylated reverse primer 
pi 1 bio and the primer loop-cod5 which is modified to introduce a Cfol recognition site (fig. 72a). The amplified product 

50 is 192 bp in length. After PGR the amplification product was bound to streptavidin coated paramagnetic particles as 
described above. The antisense strand was isolated by denaturation of the double stranded amplified product (Fig. 
72b). The intra-molecule annealing of the complementary 3' end was accomplished by a short heat denaturation step 
and incubation at 37**C. The 3' end of the antisense strand is now partially double stranded (Fig. 72c). For analyzing 
the DNA downstream of the self annealed 3' -end of the antisense strand, the primer oligo base extension (PROBE) 

55 has been performed using ddATP, dGTP, dGTP, dTTP (Fig. 72d). This generates different products in length specific 
for the genotype of the analyzed individual. Before the determination of the length of these diagnostic products, the 
DNA was incubated with the Cfol restriction endonuclease that cuts 5' of the extended product. This step frees the 
stem loop from the template DNA whereas the extended product still keeps attached to the template. The extended 
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products are then denatured by heating from the template stand and analyzed by MALDI-TOF mass spectrometry. 
[0432] Since the MALDI-TOF analyses were performed with anon-calibrated instrument, the mass deviation between 
observed and expected values was approximately 0.6% higher than theoretically calculated. Nevertheless, the results 
obtained were conclusive and reproducible within repeated experiments. In all analyzed supernatants after the restric- 

5 tion digest the stem loop could be detected. Independent of the genotype, the stem loop has had in all analyses mo- 
lecular masses about 81 50 Da (expected 8111 Da). An example is shown in Figure 73a. The second peak in this figure 
with a mass of 4076 Da is a doubly charged ion of the stem loop. Figure 73b to 73d show the analyses of different 
genotypes as indicated in the respective inserts. HbA is the wildtype genotype and HbC and HbS are two different 
mutations in codon 6 of the p-globin gene which cause sickle cell disease. In the wildtype situation a single peak with 

10 a molecular mass of 4247 Da and another with 6696 Da are detected (Fig. 73b). The latter corresponds to the bioti- 
nylated PGR primer (p-11-bio) unused in the PGR reaction which also has been removed in some experiments. The 
former corresponds to the diagnostic product for HbA. The analyses of the two individual DNA molecules with HbS 
trait as well as compound heterozygosity (HbS/HbC) for the sickle cell disorder lead also to unambiguous expected 
results (Fig. 73c and 73d). 

15 [0433] In conclusion, the LOOP-PROBE is a powerful means for detection of mutations especially predominant dis- 
ease causing mutations or common polymorphisms. The technique eliminates one specific reagent for mutation de- 
tection and, therefore, simplifies the process and makes it more amenable to automation. The specific extended product 
that is analyzed is cleaved off from the primer and is therefore shorter compared to the conventional method. In addition, 
the annealing efficiency is higher compared to annealing of an added primer and should therefore generate more 

20 product. The process Is compatible with multiplexing and various detection schemes ( e.g. , single base extension, oligo 
base extension and sequencing). For example, the extension of the loop-primer can be used for generation of short 
diagnostic sequencing ladders within highly polymorphic regions to perform, for example, HLA typing or resistance as 
well as species typing (e.g. , Mycobacterium tuberculosis)). 

25 EXAMPLE 20 

T7-RNA Polymerase Dependent Amplification of CKR-5 and Detection by IVIALDi-IVIS 
MATERIALS AND METHODS 

30 

[0434] Genomic DNA. Human genomic DNA was obtained from healthy individuals. 

[0435] PCR-Amplification and Purification. PGR amplification of a part of the GKR-5 gene was accomplished using 
ckrT7f as sense primer d(ACC TAG CGT TCA GTT CGA CTG AGA TAA TAC GAC TCA CTA TAG GAG CTC TCA 
TTT TCC ATA C (SEQ ID NO. 60). The underlined sequence corresponds to the sequence homologous to CKR-5, the 

35 bolded sequence corresponds to the T7-RNA polymerase promoter sequence and the italic sequence was chosen 
randomly. ckr5r was used as antisense primer d(AAG TAA GGC ATG TGG AGA AGA (SEQ ID NO. 61). Purification of 
the amplified product and removal of unincorporated nucleotides was carried out using the QIAquick purification kit 
(Qiagen, cat# 28104). In the final PGR volume of 50 [l\ were 200 ng genomic DNA, 1 U Taq -polymerase (Boehringer- 
Mannhelm, cat# 1596594), 1 .5mM MgCl2 0.2 mM dNTPs (Boehringer-Mannheim, cat# 1277049), and 1 0 pmol of each 

40 primer. The specific fragment of the GKR-5 gene was amplified using the following cycling conditions: 5 min @ 94''G 
followed by 40 cycles of 45 sec @ 94"C, 45 sec 52'G, 5 sec @ 72'G, and a final extension of 5 min at 72'G. 
[0436] T7-RNA Polymerase conditions. One third of the purified DNA (about 60ng) was used in the T7-RNA polymer- 
ase reaction. (Boehringer-Mannheim, cat# 881 767). The reaction was carried out for 2h at 37'*C according to the 
manufacturer's conditions using the included buffer. The final reaction volume was 20 \i\ 0.7 [il RNasin (33 U/|xl) had 

45 been added. After the extension reaction, the enzyme was inactivated by incubation for 5 min at 65*0. 

DNA digestion and conditioning of tfie diagnostic products for mass spec analysis. 

[0437] The template DNA was digested by adding RNase-free DNase I (Boehringer-Mannheimn, cat# 776 758) to 
50 the inactivated T7 mixture and incubation for 20 min at room temperature. Precipitation was carried out by adding 1 
^il glycogen (10 mg/ml, Sigma, cat# G1765), 1/10 volume 3M NH2. acetate (pH 6.5), and 3 volume absolute ethanol 
and incubation for 1 hour at room temperature. After centrifugation at 13,000 g for 10 min, the pellet was washed in 
70% ethanol and resuspended in 3 [j,l 18 Mohnn/cm H2O. 1 [l\ was analyzed on an agarose gel. 

55 Sample preparation and analysis on MALDI-TOF mass spectrometry 

[0438] Sample preparation was performed by mixing 0.6 [il of matrix solution (0.7 M 3-hydroxypicolinic acid, 0.07 M 
dibasic ammonium citrate in 1 :1 HgOiGHgGN) with 0.3 ^il of resuspended DNA/glycogen on a sample target and allowed 
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to air dry. The sample target was introduced into the source region of an unmodified Finnigan VISION2000 IVIALDI- 
TOF operated in relectron mode with 5kV. The theoretical molecular mass was calculated form atomic composition; 
reported experimental values are those of singly-pronated form. 

5 RESULTS 

[0439] The chemokine receptor CKR-5 has been identified as a major coreceptor in HIV-1 (see e.g., WO 96/39437 
to Human Genome Sciences; Cohen, J. et al. Science 275 :1261). A mutant allele that is characterized by a 32 bp 
deletion Is found in 16% of the HIV-1 seronegative population whereas the frequency of this allele is 35% lower in the 

10 HIV-1 seropositive population. It is assumed that individuals homozygous for this allele are resistant to HIV-1. The 
T7-RNA polymerase dependent amplification was applied to identify this specific region of the chemokine receptor 
CKR-5 (Figure 74). Human genomic DNAwas amplified using conventional PCR. Thesense primer has been modified 
so that it contains a random sequence of 24 bases that facilitate polymerase binding and the T7-RNA polymerase 
promoter sequence (Figure 75). The putative start of transcription is at the first base 5' of the promoter sequence. ckr5r 

15 was used as an antisense primer. PGR conditions are outlined above. The amplified product derived from wildtype 
alleles is 75 bp in length. Primer and nucleotides were separated from the amplification product using the Qiagen 
QIAquick purification kit. One third of the purified product was applied to in vitro transcription with T7-RNA polymerase. 
To circumvent interference of the template DNA, it was digested by adding RNase-free DNase I. RNA was precipitated 
and this step also leaves the degraded DNA in the supernatant. Part of the redissolved RNA was analyzed on an 

20 agarose gel and the rest of the sample was prepared for MALDI-TOF analysis. The expected calculated mass of the 
product is 24560 Da. A dominant peak, that corresponds to an approximate mass of 25378.5 Da can be observed. 
Since the peak is very broad, an accurate determination of molecular mass was not possible. The peak does not 
correspond to residual DNA template. First, the template DNA is digested, and second, the DNA strands would have 
a mass of 23036.0 and 231 74 Da, respectively. 

25 [0440] This example shows that T7 RNA polymerase can effectively amplify target DNA. The generated RNA can 
be detected by Mass spectrometry. In conjunction with modified (e.g. , 3'-deoxy)ribonucleotides that are specifically 
incorporated by a RNApolymerase but not extended any further, this method can be applied to determine the sequence 
of a template DNA. 

30 EXAMPLE 21 

MALDI Mass Spectrometry of RNA Endonuclease Digests 

MATERIALS 

35 

[0441] Synthetic RNA (Sample A:5'- UCCGGUCUGAUGAGUCCGUGAGGAC-3' (SEQ ID 62); sample B:5'-GUCAC- 
UACAGGUGAGCUCCA-3' (SEQ ID NO 63); sample C:5'-CCAUGCGAGAGUAAGUAGUA-3' (SEQ ID NO. 64)) sam- 
ples were obtained from DNA technology (Aahus, Denmark) and purified on a denaturing poly aery I amide gel (Shaler, 
T. A. et al. (1996) Anal. Chem. 63 :5766-579). Rnases T-, (Eurogentec), (Calbiochem), A (Boehringer-Mannheim) 
40 and PhyM (Pharmacia) were used without additional purification. Streptavidin-coated magnetic beads (Dynabeads M- 
280 Streptavidin, Dynal) were supplied as a suspension of 6-7 x 10^ bead/ml (10 mg/ml) dissolved in phosphate- 
buffered saline (PBS) containing 0.1% BSA and 0.02% NaNg. 3-Hydroxypicolinic acid (3-HPA) (Aldrich) was purified 
by a separate desalting step before use as described in more detail elsewhere (Little, D. P. et al. (1995) Proc. Natl. 
Acad. Sci. U.S.A. 92 , 2318-2322). 

45 

METHODS 

[0442] In vitro transcription reaction. The 5'-biotinylat6d 49 nt in vitro transcript (SEQ ID No. 65): 

50 

AQGCCUGCGGCAAQACGGAAAGACCAUGGUCCCUNAUCUGCCGCAGGAUC 

was produced by transcription of the plasmid pUTMS2 (linearized with the restriction enzyme BamHI) with T7 RNA 
polymerase (Promega). For the transcription reaction 3 |ig template DNA and 50u T7 RNA polymerase were used in 
55 a 50 ii\ volume of 1 u/\i\ RNA guard (Rnax inhibitor, Pharmacia), 0.5 mM NTP's 1 .0 mM 5'-biotin-ApG dinucleotide, 40 
mM Tris-HGI (pH 8.0), 6 mM MgClg 2 mM spermidine and 10 mM DTT. Incubation was performed at 37*0 for 1 hour, 
then another aliquot of 50 units T7 RNA polymerase was added and incubation was continued for another hour. The 
mixture was adjusted to 2M NH4. acetate and the RNA was precipitated by addition of one volume of ethanol and one 
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volume of isopropanol. The precipitated RNA was collected by centrifugation at 20,000 X g for 90 min at 4'*C, the pellet 
was washed with 70% ethanol, dried and redissolved at 8 M urea. Further purification was achieved by electrophoresis 
through a denaturing polyacrylamide gel as described elsewhere (Shaler, T. A. etal. (1996) Anal. Chem. 68 :576-579). 
The ration of 5'-biotinylated to non-biotinylated transcripts was about 3:1 . 

[0443] Ribonudease assay. For partial digestion with selected RNases different enzyme concentrations ad assay 
conditions were employed as summarized in table VII. The solvents for each enzyme were selected following the 
suppliers' instructions. The concentrations of the synthetic RNA samples and the in vitro transcript were adjusted to 
5-10 X 10-BM. 
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[0444] The reaction was stopped at selected times by mixing 0.6 jal aliquots of tine assay witin 1 .5 )il of 3 HPA-solution . 
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The solvent was subsequently evaporated in a stream of cold air for the MALDI-MS analysis. 

[0445] Limited alkaline hydrolysis was perfomied by mixing equal volumes (2.0 [x.\) of 25% ammonium hydroxide and 
RNA sample (5-10 x 10"^ M) at 60*C. 1 \i\ aliquots were taken out at selected times and dried in a stream of cold air 
For these samples it turned out to be important to first dry the digests in a stream of cold air, before 1 .5|a.l of the matrix 
5 solution and 0.7 \i\ of NH4-1- loaded cation exchanged polymer beads were added. 

[0446] The reaction was stopped at selected times by mixing 0.6 \i\ aliquots of the assay with 1 .5 [xlof 3HPA-solution. 
The solvent was subsequently evaporated in a stream of cold air for the MALDI-MS analysis. 

[0447] Limited alkaline hydrolysis was performed by mixing equal volumes (2.0 of 25% ammonium hydroxide and 
RNA sample (5-1 0x10-^ M) at 60*C. 1 \x\ aliquots were taken out at selected times and dried in a stream of cold air 

10 For these samples it turned out to be important to first dry the digests in a stream of cold air, before 1 .5 |^l of the matrix 
solution and 0.7 |xl if a suspension of NH4+ loaded cation exchange polymer beads were added. 
[0448] Separation of 5'-biotfny!ated fragments. Steptavidin-coated magnetic beads were utilized to separate 5'-bioti- 
nylated fragments of the in Wfro transcript after partial RNase degradation. The biotin moiety in this sample was intro- 
duced during the transcription reaction Initiated by the5'-biotin-pApG-dinucleotide. Priorto use, the beads were washed 

15 twice with 2 x binding & washing (b&w) buffer (20 mM Tris-HCI, 2 mM EDTA, 2 M NaCI pH 8.2) and resuspended at 
10 mg/ml in 2 x b&w buffer. Circa 25 pmol of the RNA in Wfro transcript were digested by RNase U2 using the protocol 
described above. The digestion was stopped by adding 3 ^il of 95% formamide containing 1 0 mM trans- 1 ,2-diamino- 
cyclohexane-N,N,NVlM"'-tetraacetic acid (CDTA) at 90'C for 5 min, followed by cooling on ice. Subsequently, capture 
of the biotinylated fragments was achieved by incubation of 6[il of the digest with 6 iil of the bead suspension and 3|il 

20 of b&w buffer at room temperature for 15 min. Given the binding capacity of the beads of 200 pmol of biotinylated 
oligonucleotide per mg of beads, as specified by the manufacturer, the almost 2-times excess of oligonucleotide was 
used to assure a full loading of the beads. The supernatant was removed, and the beads were washed twice with 6p.l 
of HjO. The CDTA and 95% formamide at 90'*C for 5 min. After evaporation of the solvent and the formamide the ^.5 
pmol of fragments were resuspended in 2 jxl HgO and analyzed by MALDI-MS as described above. 

25 [0449] Sampie preparation for MALDI-MS. 3-Hydroxypicolinic acid (3-HPA) was dissolved in ultra pure water to a 
concentration of ca. 300 mM. Metal cations were exchanged against NH4+ as described in detail previously. (Little, D. 
P. eta!. (1 995) Proc. Natl. Acad. Sci. U.S.A. 92: 231 8-2322). Aliquots of 0.6 ^1 of the analyte solution were mixed with 
1 .5 |j.l 3-HPA on a flat inert metal substrate. Remaining alkali cations, present in the sample solution as well as on the 
substrate surface, were removed by the addition of 0.7 \i\ of the solution of NH4+ - loaded cation exchange polymer 

30 beads. During solvent evaporation, the beads accumulated in the center of the preparation, were not used for the 
analysis, and were easily removed with a pipette tip. 

[0450] Instrument. A prototype of the Vision 2000 (ThermBioanalysis, Hemel, Hempstead, UK) reflectron time of 
flight mass spectrometer was used for the mass spectrometry. Ions were generated by irradiation with a frequency- 
tripled NDiYAG laser (355 nm, 5 ns; Spektrum GmbH, Berlin, Germany) and accelerated to 10 ke V. Delayed ion 

35 extraction was used for the acquisition of the spectra shown, as it was found to substantially enhance the signal to 
noise ratio and/or signal intensity. The equivalent flight path length of the system is 1 .7 m, the base pressure Is 1 0^^ 
Pa. Ions were detected with a discrete dynode secondary-electron multiplier (R2362, Hamamatsu Photonics), equipped 
with a conversion dynode for effective detection of high mass ions. The total impact energy of the ions on the conversion 
dynode was adjusted to values ranging from 16 to 25 keV, depending on the mass to be detected. The preamplified 

40 output signal of the SEM was digitized by a LeCroy 9450 transient recorder (LeCroy, Chestnut Ridge, NY, USA) with 
a sampling rate of up to 400 MHz. For storage and further evaluation, the data were transferred to a personal computer 
equipped with custom-made software (ULISSES). All spectra shown were taken in the positive ion mode. Between 20 
and 30 single shot spectra were averaged for each of the spectra shown. 

45 RESULTS 

[0451] Specificity of Rnases. Combining base-specific RNA cleavage with MALDI-MS requires reaction conditions 
optimized to retain the activity and specificity of the selected enzymes on the one hand and complying with the boundary 
conditions for MALDI on the other. Incompatibility mainly results because the alkaline-ion buffers, commonly used in 

50 the described reaction, such as Na-phosphate, Na-citrate or Na-acetate as well as EDTA interfere with the MALDI 
sample preparation; presumably they disturb the matrix crystallization and/or analyte incorporation. Tris-HCI or am- 
monium salt buffers, in contrast, are MALDI compatible (Shaler, T. A. etal. (1 996) Anal. Chem. 68 :576-579). Moreover, 
alkaline salts in the sample lead to the formation of a heterogenous mixture of multiple salts of the analyte, a problem 
increasing with increasing number of phosphate groups. Such mixtures result in loss of mass resolution and accuracy 

55 as well as signal-to-noise ratio (Little, D. R et aL (1995) Proc. Natl. Acad. Sci. U.S.A. 92:2318-2322; Nordhoff, E., 
Cramer, R. Karas, M.,Hillenkamp, F., Kirpekar, F., Kristiansen, K. and Roepstorff, P. (1993) Nucleic Acids Res., 21, 
3347-3357). Therefore, RNase digestions were carried out under somewhat modified conditions compared to the ones 
described in the literature. They are summarized above in table Vll. For Rnase T^, A, CL3 ad Cusativin, Tris-HCI (pH 
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6-7.5) was used as buffer. 20 mM DAC provides the pH of 5, recommended for maximum activity of RNases U2 and 
PhyM. The concentration of 1 0-20 mM of these compounds were found to not interfere significantly with the MALDI 
analysis. To examine the specificity of the selected ribonucleases under these conditions, three synthetic 20-25mer 
RNA molecules with different nucleotide sequences were digested. 

5 [0452] The MALDI-MS spectra of Figure 77 shows five different cleavage patterns (A-E) of a 25 nt RNA obtained 
after partial digestion with RNases T^ PhyM, A, and alkaline hydrolysis. These spectra were taken from aliquots 
which were removed from the assay after empirically determined incubation times, chosen to get an optimum coverage 
of the sequence. As the resulting samples were not fractionated prior to mass spectrometric analysis, they contain all 
fragments generated at that time by the respective RNases. In practice, uniformity of the cleavages, can be affected 

10 by a preferential attack on the specific phophodiester bonds (Donis-Keller, H., Maxam, A. M., and Gilbert, W. (1977) 
Nucleic Acids Res., 4, 1957-1978; Donis-Keller, H. (1980) Nucleic Acids Res., 8 3133-3142). The majority of the ex- 
pected fragments are indeed observed in the spectra. It is also worth noting that for the reaction protocols as used, 
correct assignment of all fragment masses is only possible, if a 2', 3'-cyclic phosphate group Is assumed. It is well 
known that such cyclic phosphates are intermediates in the cleavage reaction and get hydrolyzcd in a second, inde- 

15 pendent and slower reaction step involving the enzyme (Richards, F. M., and Wycoff, H. W. in The Enzymes Vol. 4, 
3rd Ed., (ed. Boyer, P.O.) 746-806 (1971, Academic Press, New York); Heinemann, U and W. Saenger (1985) Pure 
Appl. Chem. 57, 41 7-422; Ikehara, M. et al., (1 987) PureAppl, Chem. 59-965-968) Vreslow, R. and Xu, R. (1 993) Proc. 
Nat. Acad. Sci. USA, 90, 1201-1207). In a few cases different fragments have equal mass of differ by as little as 1 
Dalton., In these cases, mass peaks cannot unambiguously be assigned to one or the other fragments. Digestion of 

20 two additional different 20 nt RNA samples was, therefore, performed (Hahner, S., Kirpekar, P., Nordoff, E., Kristiansen, 
K., Roepstorff, P. and Hillenkamp, F. (1996) Proceedings of the 44th ASMS Conference on Mass Spectrometry, Port- 
land, Oregon) in order to sort out these ambiguities. For all samples tested, the selected ribonucleases appear to 
cleave exclusively at the specified nucleotides leading to fragments arising from single as well as multiple cleavages. 
[0453] In Figure 77, peaks, indicating fragments containing the original 5'-temninus, are marked by arrows. All non 

25 marked peaks can be assigned to internal sequences or those with retained 3'-terminus. For a complete sequence all 
possible fragments bearing exclusively either the 5'- or the 3'-terminus of the original RNA would suffice. In practice, 
the 5'-fragments are better suited for this purpose, because the spectra obtained after incubation of all three synthetic 
RNA samples contain the nearly complete set of originals of 5'-ions for all different RNases (Hahner, S., Kirpekar, R, 
Nordoff, E., Kristiansen, K., Roepstorff and Hillenkamp, F. (1 996) Proceedings of the 44th ASMS Conference on Mass 

30 Spectrometry, Portland, Oregon). Internal fragments are somewhat less abundant and fragments containing the original 
3'-terminus appear suppressed in the spectra. In agreement with observations reported in the literature (Gupta, R. C. 
and Randerath, K. (1977) Nucleic Acids Res., 4, 1 957-1978), cleavages close to the S'-terminus were partially sup- 
pressed in partial digests of the RNA 25 mer by RNase T^ and U2 (even if they are internal or contain the original 5'- 
terminus). Fragments from such cleavages appear as weak and poorly resolved signals in the mass spectra. 

35 [0454] For larger RNA molecules secondary structure is known to influence the uniformity of the enzymatic cleavages 
(Donis-Keller, H., Maxam, A. M. and Gilbert, A. (1977) Nucleic Acids Res. 8, 3133-3142). This can, in principle be, 
overcome by altered reaction conditions. In assay solutions containing 5-7 M urea, the activity of RNases such as T2, 
□2, A, CI3, and PhyM is known to be retained (Donis-Keller, H., Maxam, A. M. and Gilbert, W. (1977) Nucleic Acids 
Res., 4, 2527-2537; Boguskl, M. S., Hieter, P.A., and Levy, C. C. (1980) J. Biol. Chem.,255, 2160-2163; Donis-Keller, 

40 H. (1980) Nucleic Acids Res., 8, 3133-3142, while RNA Is sufficiently denatured. UV-MALDI-analysis with 3-HPA as 
matrix is not possible under such high concentrations of urea in the sample. Up to a concentration of 2 M urea in the 
reaction buffer, MALDI analysis of the samples was still possible although significant changes in matrix crystallization 
were obsen/ed. Spectra of the RNA 20 mer (sample B), digested in the presence of 2 M urea still resembled those 
obtained under conditions listed in Table VII. 

45 [0455] Digestion by RNases which exclusively recognize one nucleobase is desirable to reduce the complexity of 
the fragment patterns and thereby facilitate the mapping of the respective nucleobase. RNases CL3 and cursavitin are 
enzymes reported to cleave at cytidylic acid residues. Upon limited RNase CL3 and cursativin digestion of the RNA- 
20m6r (sample B) under non-denaturing conditions, fragments corresponding to cleavages at cytidylic residues were 
indeed observed (Figure 78). Similar to the data reported so far (Boguskl, M. S., Hieter, P. A. and Levy, C. C. (1980) 

50 J. Biof. C/je/n.,255,2160-2163: Rojo, M. A., Arias, F. J., Iglesias, R., Ferreras, J. M., Munoz, R., Escarmis, C, Soriano, 
F., Llopez-Fando, Mendez, E., and Girbes, T. (1994) Planta, 194, 328-338). The degradation pattern in Figure 78, 
however, reveals that not every cytidine residue is recognized, especially for neighboring C residues. RNase CL3 is 
also reported to be susceptible to the Influence of secondary structure (Boguskl, M. S., Heiter, P. A., and Levy, C. C. 
(1980) J Biol. Chem., 255, 2160-2163), but for RNA of the size employed in this study, such an influence should be 

55 negligible. Therefore, unrecognized cleavage sites in this case can be attributed to a lack of specificity of this enzyme. 
To confirm these data, a further RNase CLs-digestion was performed with the RNA 20mer (sample C). As a result of 
the sequence of this analyte, all three linkages containing cytidylic acid were readily hydrolyzed, but additional cleav- 
ages at uridylic acid residues were detected as well. Since altered reaction conditions such as increased temperature 
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(90'C), various enzyme to substrate ratios, and addition of 2M urea did not result in a digestion of the expected spe- 
cificity, application of this enzyme to sequencing was not pursued furtlier. Introduction of a new cytidine-specific ribo- 
nuclease, cusativin, isolated fomn dry seeds of Cucumis sativus L. looked promising for RNA sequencing (Rojo, M. A., 
Arias, F. J., Iglesias, R., Ferreras, J. M., Munoz, R., Escamnis, C, Soriano, F., Llopez-Fando, J., Mendez, E. and Girbes, 

5 T. (1994) Planta, 194, 328-338). As shown in Figure 78, not every cytidine residue was hydrolyzed and additional 
cleavages occurred at uridylic acid residues for the recommended concentration of the enzyme. RNases CL3 and 
cusativin will, therefore not yield the desired sequence information for mapping of cytidine residues and their use was 
not further pursued. The distinction of pyrimidine residues can be achieved, however, by use of RNases with multiple 
specificities, such as Physarum polycephafum RNase (cleaves ApN, UpN) and pancreatic RNase A (cleaves UpN, 

10 CpN) (see Figure 77). All 5 '-term in us fragments, generated by the monospecific RNase and apparent in the spectrum 
of Figure 77C were also evident in the spectrum of the RNase PhyM digest (Figure 77D) . Five of the six uridilic cleavage 
sites could, this way, be uniquely identified by this indirect method. In a next step, the knowledge of the uridine cleavage 
sites was used to identify sites of cleavage of cytidllic acid residues in the spectrum recorded after incubation with 
RNase A (Figure 77E), again using exclusively ions containing the original 5'-terminus. Two of the four expected cleav- 

15 age sites were identified this way. A few imitations are apparent from these spectra, if only the fragments containing 
the original 5'-terminus are used for the sequence determination. The first two nucleotides usually escape the analysis, 
because their signals get lost in the low mass matrix background. Because of this, the corresponding fragments are 
missing in the spectra of the U- and C-specific cleavages. Large fragments with cleavage sites close to the 3'-terminus 
are often difficult to identify, particularly in digests with RNases T"" ■ and U2, because of their low yield (vide supra) and 

20 the often strong nearby signal of the non-digested transcript. Accordingly the cleavages In position 22 and 23 do not 
show up in the spectrum of the G-specific RNase T, (Figure 77A) and the cleavage site 24 cannot be identified from 
the spectra of the Ug and PhyM digests (Figures 77 C and D). Also site 1 6 and 1 7 with two neighboring cytidllic acids 
cannot be identified in the RNase A spectrum of Figure 77E. These observations demonstrate that a determination of 
exclusively the 5'-terminus fragments may not always suffice and the information contained in the internal fragments 

25 may be needed for a full sequence analysis. 

[0456] Finally, limited alkaline hydrolysis provides a continuum of fragments (Figure 77B), which can be used to 
complete the sequence data. Again, the spectrum is dominated by ions of fragments containing the 5'-terminus, al- 
though the hydrolysis should be equal for all phosphodiester bonds. As was true for the enzymatic digests, correct 
mass assignments requires one to assume that all fragments have a 2', 3'-cyclic phosphate. The distribution of peaks, 

30 therefore, resembles that obtained after a 3'-exonuclease digest (Pieles, U., Zurcher, W., Schar, M. and Moser, H. E., 
(1 993) Nucleic Acids Res., 21 , 31 91 -31 96; Nordhoff, E. et aL (1 993) Book of Abstracts, 1 3^^ Internat. Mass Spectrom. 
Conf., Budapest p. 21 8; Kirpekar, F., Nordhoff, E., Kristiansen, K., Roepstorff, P., Lezius, A. Hahner, S., Karas, M. and 
Hillenkamp. F. (1994) Nucleic Acid Res., 22, 3866-3870). In principle, the alkaline hydrolysis alone could, therefore, 
be used for a complete sequencing. This is, however, only possible for quite small oligoribonucleotides, because larger 

35 fragment ions, differing in mass by only a few mass units will not be resolved in the spectra and the mass of larger 
Ions cannot be determined with the necessary accuracy of better than 1 Da, even if peaks are partially orfully resolved. 
The Interpretation of the spectra particularly from digests of unknown RNA samples is substantially simplified, if only 
the fragments containing the original 5'-terminus are separated out prior to the mass spectrometric analysis. A proce- 
dure for this approach is described in the following section. 

40 [0457] Separation of 5'-biotinylated fragments. Streptavidin-coated magnetic beads (Dynal) were tested for the ex- 
traction of fragments containing the original S'-temninus from the digests. Major features to be checked for this solid- 
phase approach are the selective immobilization and efficient elution of biotinylated species. In preliminary experiments, 
a 5'-biotinylated DNA (1 9 nt) and streptavidin were incubated and MALDI analyzed after standard preparation. Despite 
the high affinity of the streptavidin-biotin interaction, the intact complex was not found in the MALDI spectra. Instead, 

45 signals of the monomeric subunit of streptavidin and the biotinylated DNA were detected. Whether the complex dis- 
sociates in the acidic matrix solution (pKA3) or during the MALDI desorption process, is not known. Surprisingly, if the 
streptavidin is immobilized on a solid surface such as magnetic beads, the same results are not obsen/ed. A mixture 
of two 5'-biotinylat6d DNAsamples(19 nt and27nt) and two unlabeled DNA sequences (12 ntand22 nt) were incubated 
with the beads. The beads were extracted and carefully washed before incubation in the 3-HPA MALDI matrix. No 

50 analyte signals could be obtained from these samples. To test whether the biotinylated species had been bound to the 
beads altogether, elution form the extracted and washed beads was performed by heating at 90°C in the presence of 
95%formamide. This procedure is expected to denature the streptavidin, thereby breaking thestreptavidin/biotin com- 
plex. Figure 79B shows the expected signals of the two biotinylated species, provlngthat release of the bound molecules 
in the MALDI process is the problem rather than the binding of the beads; Figure 79A shows a spectrum of the same 

55 sample after standard preparation, showing signals of all four analytes as a reference. Complete removal of the for- 
mamide after the elution and prior to the mass spectrometric analysis was found to be important, otherwise crystalli- 
zation of the matrix is disturbed. Mass resolution and the signal-to-noise ration in spectrum 798 are comparable to 
those of the reference spectrum. These results testify to the specificity of the streptavidin-biotin interaction, since no 
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or only minor signals of the non-biotinylated analyte were detected after incubation with the Dynal beads. Increased 
suppression of nonspecific binding was reported through an addition of the detergent Tween-20 to the binding buffer 
(Tong, X. and Smith, L. M. (1992) Anal Chem., 64, 2672-2677). Although this effect could be confirmed in this study, 
peak broadening affected the quality of the spectra due to remaining amounts of the detergent. The necessity of an 
5 elution step as a prerequisite for detection of the captured biotinylated species can be attributed to a stabilizing effect 
of the complex by the immobilization of the streptavidin to the magnetic beads. 

[0458] For practical application of this solid phase method to sequencing a maximum efficiency of binding and elution 
of biotinylated species is of prime importance. Among a variety of conditions investigated so far, addition of salts such 
as EDTAgave best results in the case of DNA sequencing by providing Ionic strength to the buffer (Tong, X. and Smith, 

10 L. M. (1 992) Anal Chem., BA, 2672-2677). To examine such an effect on the solid-phase method, several salt additives 
were tested for the binding and elution of the 5'-biotinylated RNA in v/fro transcript (49 nt). The results are shown in 
Figure 80. Judging from the relative intensity, signal-to-noise ration, and resolution of the respective signals, a 95% 
formamide solution containing 10 mM CDTA (Figure SOD) is most efficient for the binding/elution. Since CDTA acts as 
a chelating agent for divalent cation, formation of proper secondary an tertiary structure of the RNA is prevented. An 

15 improved sensitivity and spectral resolution has been demonstrated under such conditions for the analysis of RNA 
samples by electrospray mass spectrometry (Limbach, P. A., Grain, P. F. and McCloskey, J. A. (1 995) J Am. Soc. Mass. 
Spectrom., 6, 27-39). The improvement In the MALDI analysis is actually not very significant compared to the spectrum 
obtained for the solution containing formamide alone (Figure 81 b), but the reproducibility for spectra of good quality 
was substantially improved for the CDTA/formamide solution. Thus in addition to the improved binding/elution, this 

20 additive may also improve the incorporation of the analyte into the matrix crystals. Unfortunately, a striking signal 
broadening on the high mass side was observed in case of formamide solutions containing EDTA, CDTA or 25% 
ammonium hydroxide. Since this effect is most prominent in case of 25% ammonium hydroxide and this agent was 
also used for adjusting EDTA and CDTA to their optimum pH, a pronounced NHg adduct ion formation ca be assumed. 
[0459] The applicability of streptavidin -coated magnetic beads separation to RNA sequencing was demonstrated for 

25 the Rnase Ug digest of the 5'-biotinylated RNA/n v/Yro transcript (49 nt) (Figure 81). The entire fragment pattern obtained 
after incubation with Rnase U2 is shown is spectrum 81 A. Separation of the biotinylated fragments reduces the com- 
plexity of the spectrum (Figure 81 B) since only 5'-terminal fragments are captured by the beads. The signals in the 
spectrum are broadened and the increased number of signals in the low mass range indicate that even after stringent 
washing of the beads, some amounts of buffer and detergent used for the binding and elution remained. Further Im- 

30 provements of the method are, therefore, needed. Another possible strategy for application of the magnetic beads is 
the immobilization of the target RNA prior to RNase digestion by an elution of the remaining fragments for further 
analysis. Cleavage of the RNA was impeded in this case, as evidenced by a prolonged reaction time for the digest 
under otherwise identical reaction conditions. 

35 EXAMPLE 22 

Parallel DNA Sequencing Mutation Analysis and Microsateiiite Analysis Using Primers witli Tags and Mass 
Spcctrometric Detection 

40 [0460] This EXAMPLE describes specific capturing of DNA products generated in DNA analysis. The capturing is 
mediated by a specific tag (5 to 8 nucleotides long) at the 5' end of the analysis product that binds to a complementary 
sequence. The capture sequence can be provided by a partially double stranded oligonucleotide bound to a solid 
support. Different DNA analysis (e.g., sequencing, mutation, diagnostic, microsateiiite analysis) can be carried out in 
parallel, using, for example, a conventional tube or microtiter plate (MTP). The products are then specifically captured 

45 and sorted out via the complementary identification sequence on the tag oligonucleotide. The capture oligonucleotide 
can be bound onto a solid support (e.g., silicon chip) by a chemical or biological bond. Identification of the sample is 
provided by the predefined position of the capute oligonucleotide. Purification , conditioning and analysis by mass spec- 
trometry are done on solid support. This method was applied for capturing specific primers that had a 6 base tag 
sequence. 

50 

MATERIALS AND METHODS 

Genomic DNA. 

55 [0461] Genomic DNA was obtained from healthy individuals. 
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PCR Ampfification 

[0462] PCR amplifications of part of the p-globin gene were established using p2 d(CATTTGCTTCTGACACAACT 
Seq. ID. No. 66) as forward primer and p11 d(TCTCTGTCTCCACATGCCCAG Seq. ID. No. 67) as reverse primer. The 

5 total PCR volume was 50 \x.\ including 200 ng genomic DNA, 1 U Taq-polymerase (Boehringer-Mannheim, Cat# 1 59594), 
1 .5 mM MgClg, 0.2 mM dNTPs (Boehringer-Mannheim, Cat# 1 277049), and 1 0 pmol of each primer. A specific fragment 
of the p-globin gene was amplified using the following cycling conditions: 5 min @ 94*C followed by 40 cycles of 30 
sec @ Q4°C, 45 sec @ 53'C, 30 sec @ 72"C, and a final extension of 2 min @ 72*C. Purification of the amplified 
product and removal of unincorporated nucleotides was carried out using the QIAquick purification kit (Qiagen, Cat 

10 28104). Onefifth of the purified productwasusedforthe primer oligo base extension (PROBE) or sequencing reactions, 
respectively. 

Primer oUgo base extension (PROBE) and sequencing reactions 

15 [0463] Detection of putative mutations in the human p-globin gene at codcn 5 and 6 and at codon 30 and in the IVS- 
1 donor site, respectively, was done in parallel (FIGURE 82A). p-TAG1 (GTCGTCCCATGGTGCACGTGACTC Seq. 
ID. No. 68) served as primer to analyze codon 5 and 6 and p-TAG2 (CGGTGTGGTGAGGCCGTGGGCA Seq. ID. No. 
69) for the analyses of codon 30 and the IVS-1 donor site. The primer oligo base extension (PROBE) reaction was 
done by cycling, using the following conditions: final reaction volume was 20 p-TAG1 primer (5 pmol), p-TAG2 primer 

20 (5 pmol), dCTP, dGTP, dTTP, (final concentration each 25 \iM), ddATP (final concentration 1 00 jxM) dNTPs and ddNTPs 
purchased from Boeringer-Mannheim, Cat# 1277049 and 1008382), 2 jil of lOx ThermoSequence buffer and 2.5 U 
ThermoSequenase (Amersham, CAT#E79000Y). The cycling program was as follows: 5 min @ 94"'C, 30 sec @ 53°C, 
30 sec @ 72''C and a final extension step for 8 min @ 72''C. Sequencing was performed under the same conditions 
except that the reaction volume was 25 [i\ and the concentration of nucleotides was 250 \iM for ddNTP. 

25 

Capturing using TAG sequence and sample preparation 

[0464] The capture oligonucleotides cap-tagi d(GACGACGACTGCTACCTGACTCCA Seq ID No. 70) and cap-tag2 
d(ACAGCGGACTGCTACCTGACTCCA Seq ID No. 71), respectively, were annealed to equimolar amounts of uni-as 

30 d(TGGAGTCAGGTAGCAGTC Seq ID No. 72) (FIGURE 82A). Each oligonucleotide had a concentration of 10 pmol/ 
1^1 in ddH20 and incubated for 2 min @ 80'C and 5 min @ 37'C. This solution was stored at -20*C and aliquots were 
taken. 10 pmol annealed capture oligonucletides were bound to 10 |a.l paramagnetic beads coated with streptavidin 
(10 mg/ml; Dynal, Dynabeads M-280 streptavidin Cat# 112.06) by incubation for 30 min @ 37"C. Beads were captured 
and the PROBE or sequencing reaction, respectively, was added to the capture oligonucleotides. To facilitate binding 

35 of p-TAGI abd p-TAG2, respectively, the reaction was incubated for 5 min @ 25*'C and for 30 min @ 1 6**C. The beads 
were washed twice with Ice cold 0.7 M NH4 Citrate to wash away unspeclfic bound extension products and primers. 
The bound products were dissolved by adding 1 |xl DDH2O and Incubation for 2 min @ 65*C and cooling on Ice. 0.3\i\ 
of the sample were mixed with 0.3^lI matrix solution (saturated 3-hydroxypicolinic acid, 10% molar ratio ammonium- 
citrate In acetonitrile/water (50/50. v/v)) and allowed to air dry. The sample target was automatically introduced into 

40 the source region of an unmodified Perspective Voyager MALDI-TOF operated in delayed extraction linear mode with 
5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular mass (Mr(calc)) were 
calculated from atomic compositions; reported experimental Mr(Mr{exp)) values are those of the singly-pronated form. 

RESULTS 

45 

[0465] Specific capturing of a mixture of extension products by a short complementary sequence has been applied 
to isolate sequencing and primer oligo base extension (PROBE) products. This method was used for the detection of 
putative mutations in the human p-globin gene at codon 5 and 6 and at codon 30 and IVS-1 donor site, respectively 
(FIGURE 82A). Genomic DNA has been amplified using the primers p2 and pi 1 . The amplification product was purified 

50 and the nucleotides separated. Onefifth of the purified product was used for analyses by primer oligo base extension. 
To analyze both sites in a single reaction, primers, p-TAGI and p-TAG2, were used respectively. p-TAGI binds upstream 
of codons 5 and 6 and p-TAG2 upstream of codon 30 and the IVS-1 donor site. Extension of these primers was per- 
formed by cycling in the presence of ddATP and dCTP, dGTP and d.TTP, leading to specific products, depending on 
the phenotype of the individual. The reactions were then mixed with the capture oligonucleotides. Capture oligonucle- 

55 otides include the biotinylated capture primer cap-tagi and cap-tag2, respectively. They have 6 bases at the 5' end, 
that are complementary to the 5' end of p-TAGI and p-TAG2, respectively. Therefore, they specifically capture these 
primers and the extended products. By annealing a universal oligonucleotide (uni-as) to the capture oligonucleotide, 
the capture primer is transformed into a partially double stranded molecule where only the capture sequence stays 
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single stranded (Figure 82). This molecule is then bound to streptavidin coated paramagnetic particles, to which the 
PROBE or sequencing reaction, respectively is added. The mixture was washed to bind only the specifically annealed 
oligonucleotides. Captured oligonucleotides are dissolved and analyzed by mass spectrometry. 
[0466] PROBE products of one individual (Fig. 83) show a small peak with a molecular mass of 7282.8 Da. This 
5 corresponds to the unextended p-TAG1 that has a calculated mass of 7287.8 Da. The peak at 8498.6 Da corresponds 
to a product, that has been e)ctended by 4 bases. This corresponds to the wildtype situation. The calculated mass of 
this product is 8500.6 Da. There is no significant peal< indicating a heterozygote situation. Furthermore only p-TAGI 
and not p-TAG2 has been captured, indicating a high specificity of this method. 

[0467] Analyses of what was bound to cap-tag2 (Figure 84) shows only one predominant peak with a molecular 
10 mass of 9331 .5 Da. This corresponds to an extension of 8 nucleotides. It indicates a homozygous wildtype situation 
where the calculated mass of the expected product is 9355 Da. There is no significant amount of unextended primer 
and only p-TAG2 has been captured. 

[0468] To prove that this approach is also suitable for capturing specific sequencing products, the same two primers 
p-TAGI and p-TAG2, respectively, were used. The primers were mixed, used in one sequencing reaction and then 
15 sorted by applying the above explained method. Two different termination reactions using ddATP and ddCTP were 
performed with these primers (Figures 85 and 86, respectively). All observed peaks in the spectrograms correspond 
to the calculated masses in a wildtype situation. 

[0469] As shown above, parallel analysis of different mutations (e.g. , different PROBE primers) is now possible. 
Further, the described method is suitable for capturing specific sequencing products. Capturing can be used for sep- 
20 aration of different sequencing primers out of one reaction tube/well, isolation of specific multiplex-amplified products, 
PROBE products, etc. Conventional methods, like cycle sequencing, and conventional volumes can be used. A uni- 
versal chip design permits the use of many different applications. Further, this method can be automated for high 
throughput. 

25 EXAMPLE 23 

Deletion Detection by Mass-Spectrometry 

[0470] Various formats can be employed for mass spectrometer detection of a deletion within a gene. For example, 
30 molecular mass of a double standard amplified product can be detemnined, or either or both of the strands of a double 
stranded product can be isolated and the mass measured as described in previous examples. 
[0471] Alternatively, as described herein, a specific enzymatic reaction can be performed and the mass of the cor- 
responding product can be determined by mass spectrometry. The deletion size can be up to several tens of bases in 
length, still allowing the simultaneous detection of the wildtype and mutated allele. By simultaneous detection of the 
35 specific products, it is possible to identify in a single reaction whether the individual is homozygous or heterozygous 
for a specific allele or mutation. 

MATERIALS AND METHODS 

40 Genomic DNA 

[0472] Leukocyte genomic DNA was obtained from unrelated healthy individuals. 
PCR amplification 

45 

[0473] PCR amplification of the target DNA was established and optimized to use the reaction products without a 
further purification step for capturing with streptavidin coated beads. The primers for target amplification and for PROBE 
reactions were as follows: 

[0474] CKRA-F:d(CAG CTC TCA TTTTCC ATA C SEQ ID. NO. 73) and CKRA-R bio: d(AGC CCC AAG ATG ACT 
50 ATC SEQ ID. NO. 74). CKR-5 was amplified by the following program: 2 min @ 94°C, 45 seconds @ 52°C, 5 seconds 
@ 72°C, and a final extension of 5 minutes at 72''C. The final volume was 50 including 200 ng genomic DNA ID 
Taq -polymerase (Boehringer-Mannheim, Cat # 1 596594), 1 .5 Mm MgClg, 0.2 Mm DNTPS (Boehringer-Mannheim, Cat 
# 1277049), 10 pmol of unmodified forward primers, and 8 pmol 5' biotinylated reverse primer 

55 Capturing and Denaturation of Biotinylated Templates 

[0475] 10 1^1 paramagnetic beads coated with streptavidin (10 mg/ml; Dynal, Dynabeads M-280 streptavidin Cat # 
112.06) in 5x binding solution (5M NH4CI, 0.3 M NH4OH) were added to 45 [i\ PCR reaction (5 |il of PCR reaction were 
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saved for electrophoresis). After binding by incubation for 30 min. at 37*C the supernatant was discarded. Captured 
templates were denatured with 50 \x\ of 1 00 Mm NaOH for 5 min. at ambient temperature, washed once with 50 ^1 50 
Mm NH4.OH and three times with 100 jal 10 Mm Tris/CI, Ph 8.0. The single stranded DNA served as templates for 
PROBE reactions. 

5 

Primer Oligo Base Extension (PROBE) Reaction 

[0476] The PROBE reaction was performed using Sequence 2.0 (USB Cat# E70775Z including buffer). dATP/DGTP 
and ddTTP were supplied by Boehringer-Mannheim (Cat # 1277049 and 1008382). d(CAG CTC TCA TTT TCC ATA 
10 c (SEQ ID. NO. 73) was used as PROBE primer (Figure 87). The following solutions were added tot he beads: 3.0 \i\ 
HgO, 1 .0 |xl reaction buffer, 1 .0 \i\ PROBE primer (1 0 pmol) and incubated at 65*0 for 5 minutes followed by 37*0 for 
10 min. Then 0.5 ^lI DTT, 3.5 ^lI DNTPS/ddntp each 50 \M and 0.5 |il Sequenase (0.8 U) were added and incubated 
at 37"'C for 10 min. 

1 5 T4 Treatment of DNA 

[0477] To generate blunt ended DNA, amplification products were treated with T4 DNA polymerase (Boehringer- 
Mannheim Cat# 1004786). The reactions were carried out according to the manufacturer's protocol for 20 min. at 11 *C. 

20 Direct Size Determination of Extended Products 

[0478] To determine the size of the amplified product, MALDI-TOF was applied to one strand of the amplification 
product, samples were bound to beads, as described above, conditioned and denatured, as described below. 

25 DNA Conditioning 

[0479] After the PROBE reaction the supernatant was discarded nd the beads were washed first in 50 [il 700 mM 
NH4-citrate and second 50 |i.l 50 mM NH4-citrats. The generated diagnostic products were removed for the template 
by heating the beads in 2 jxl HgO at SO^C for 2 min. The supernatant was used for MALDI-TOF analysis. 

30 

Sample Preparation and Analysis witli MALDI-TOF fvlass Spectrometry 

[0480] Sample preparation was performed by mixing 0.6 ^lI of matrix solution (0.7 M 3-hydroxypicolinic acid, 0.07 M 
dibasic citrate in 1 ;1 H20;CH3CN) with 0.3 il\ of diagnostic PROBE products in water on a sample target and allowed 
35 to air dry. Up to 100 samples were spotted on a probe target disk for introduction into the source region of an unmodified 
Perspective Voyager MALDI-TOF instrument operated in linear mode with delayed extraction and 5 and 30 kV on the 
target and conversion dynode, respectively. Theoretical average molecular mass (Mr(calc)) of analytes were calculated 
from atomic compositions, reported experimental Mr(Mr(exp)) values are those of thesingly-pronated form, determined 
using internal calibration with unextended primers in the case of PROBE reactions. 

40 

Conventional Analyses 

[0481] Conventional analyses were performed by native polyacrylamide gel electrophoresis according to standard 
protocols. The diagnostic products were denatured with formamide prior to loading onto the gels and stained with 
45 ethldlum bromide or silver, respectively. 

RESULTS 

[0482] The CKR-5 status of 1 0 randomly chosen DNA samples of healthy Individuals were analyzed. Leukocyte DNA 
50 was amplified by PGR and an aliquot of the amplified product was analyzed by standard polyacrylamide gel electro- 
phoresis and silver staining of the DNA (Figure 88). Four samples showed two bands presumably indicating hetero- 
zygosity for CKR-5, whereas the other 6 samples showed one band, corresponding to a homozygous gene (Figure 
88). In the case where two bands were observed, they correspond to the expected size of 75 bp for the wildtype gene 
and 43 bp forthe allele with the deletion (Figure 87). Where one band was observed, the size was about 75 bp which 
55 indicated a homozygous wildtype CKR-5 allele. One DNA sample derived from a presumably heterozygous one from 
a homozygous individual were used for all further analysis. To determine the molecular mass of the amplified product, 
DNA was subjected to matrix assisted laser desorption/ionization coupled with time of flight analysis (MALDI-TOF). 
Double stranded DNA, bound to streptavidin coated paramagnetic particles, was denatured and the strand released 
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into the supernatant was analyzed. Figure 89A shows a spectrograph of a DNA sannple, that was supposed to be 
heterozygous according to the result derived by polyacrylannide gel electrophoresis (Figure 88). The calculated nnass 
of the sense strand for a wildtype gene is 23036 Da and for the sense strand carrying the deletion allele 1 31 43 (Figure 
87 and Table Vl). Since many thermostable polymerases unspecifically add an adenosine to the 3' end of the product, 

5 those masses were also calculated. They are 23349 and 13456 Da. The masses of the observed peaks (Figure 89A) 
are 23119 Da, which corresponds to the calculated mass of a wildtype DNA strand where an adenosine has been 
added (23349 Da). Since no peak with a mass of about 23036 Da was observed, the polymerase must have qualitatively 
added adenosine. Two peaks, which are close to each other, have a mass of 13451 and 13137 Da. This corresponds 
to the calculated masses of the allele, with the 32bp deletion. The higher mass peak corresponds to the product, where 

10 adenosine has been added and the lower mass peak to the one without the unspecific adenosine. Both peaks have 
about the same height, indicating that to about half of the product adenosine has been added. The peak with a mass 
of 11682 Da is a doubly charged molecule of the DNA corresponding to 23319 Da (2 x 11682 Da = 23364 Da). The 
peaks with masses of 6732 and 6575 Da are doubly charged molecules of the one with masses of 13451 and 13137 
Da and the peak with 7794 Da corresponds to the triply charged molecule of 2331 9Da. Multiple charged molecules 

15 are routinely identified by calculation. Amplified DNA derived from a homozygous individual shows in the spectrograph 
(Figure 89C) one peak with a mass 23349.6 and a much smaller peak with a mass of 23039.9 Da. The higher mass 
peak corresponds to DNA resulting from a wildtype allele with an added adenosine, that has a calculated mass of 
23349 Da. The lower mass peak corresponds to the same product without adenosine. Three further peaks with a mass 
of 11686, 7804.6 and 5852.5 Da correspond to doubly, triply and quadruply charged molecules. 

20 [0483] The unspecific added adenine can be removed from the amplified DNA by treatment of the DNA and T4 DNA 
polymerase. DNA derived from a heterozygous and a homozygous individual was analyzed after T4 DNA polymerase 
treatment. Figure 8 9B shows the spectrograph derived from heterozygous DNA.The peak corresponding to the wildtype 
strand has a mass of 23008 Da indicating that the added adenine had been removed completely. The same is observed 
for the strand with a mass of 1 31 40 Da. 

25 The other three peaks are multiply charged molecules of the parent peaks. The mass spectrograph forthe homozygous 
DNA shows one peak that has a mass of 23004 Da, corresponding to the wildtype DNA strand without an extra adenine 
added. All other peaks are derived from multiply charged molecules of this DNA. The amplified products can be analyzed 
by direct determination of their masses, as described above, or by measuring the masses of products, that are derived 
from the amplified product in a further reaction. In this "primer oligo base extension (PROBE)" reaction, a primer that 

30 can be internal, as it is in the nested PGR, or identical to one of the PGR primers, is extended for just a few bases 
before the temination nucleotide is incorporated. Depending on the extension length, the genotype can be specified. 
CKRA-F was used as a PROBE primer, and dATP/dGTP and ddTTP as nucleotides. The primer extension is AGT in 
case of a wildtype template and AT in case of the deletion (Figure 87). The corresponding masses are 6604 Da forthe 
wildtype and 6275 Da forthe deletion, respectively. PROBE was applied to two standard DNAs. The spectrograph 

35 (Figure 90A) shows peaks with masses of 6604 Da corresponding to the wildtype DNA and at 6275 Da corresponding 
to the CKR-5 deletion allele (Table VIII). The peak at a mass of 5673 Da corresponds to CKRA-F (calculated mass of 
5674 Da). Further samples were analyzed in analogous way (Figure 90B). It is unambiguously identified as homozygous 
DNA, since the peak with a mass of 6607 Da corresponds to the wildtype allele and the peak with a mass of 5677 Da 
to the unextended primer. No further peaks were observed. 

40 [0484] The example demonstrates that deletion analysis can be performed by mass spectrometry. As shown herein, 
the deletion can be analyzed by direct detection of single stranded amplified products, or by analysis of specifically 
generated diagntic products (PROBE). In addition, as shown in the following Example 26, double stranded DNA am- 
plified products can be analyzed. 


45 


55 


Size 

Galculated Mass 

Measured Mass 

wildtype w/o A 

23036 

23039/23009/23004 

wildtype with A 

23349 

23319/23350 

deletion w/o A 

13143 

13137/13139 

deletion with A 

13456 

13451 

PROBE 



wildtype 

6604 

6604/6608 

deletion 

6275 

6275 


All masses are in Dalton. 
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EXAMPLE 24 


Pentaplex tc-PROBE 


5 SUMMARY 


[0485] The multiplexing of thermocycling primer oligo base extension (tc-PROBE) was performed using five poly- 
morphic sites in three different apo lipoprotein genes, which are thought to be involved in the pathogenesis of athero- 
sclerosis. The apolipoprotein A IV gene (codons 347 and 360), the apolipoprotein E gene (codons 112 and 158), and 
10 the apolipoprotein B gene (codon 3500) were examined. All mass spectra were easy to interpret with respect to the 
five polymorphic sites. 


IVIATERIALS AND METHODS 


15 PGR Amplification 


[0486] Human leukocytic genomic DNA was used for PGR. Listed below are the primers used for the separated 
amplification of portions of the Apo A IV, Apo E and the Apo B genes: 


Apo A IV: A347F; 5'-CGA GGA GCT CAA GGC GAG AAT-3' 

(SEQ ID NO. 75) 

A360 R-2-bio: *5'-CAG GGG CAG CTC AGO TCT C-3' 

(SEQ ID NO. 76) 

Apo E: ApoE-F: 5'-GGC ACQ GCT GTC CAA GGA-3' 

(SEQ ID NO. 77) 

ApoE-R bio: *5'-AGG CCQ CGC TCG GOG CCC TC-3' 

(SEQ ID NO. 78) 

Apo B: ApoB-F2 bio: *5'-CTT ACT TGA ATT CCA AGA GC-3' 

(SEQ ID NO. 79) 

Apo B-R: 5'-GGG CTG ACT TGC ATG GAC CGG A-3' 

(SEQ ID NO. 80) 


• ttiotinyfated 


40 [0487] Taq polymerase and 1 0x buffer were purchased from Boehringer-Mannheim (Germany) and dNTPs for Phar- 
macia (Freiburg, Germany). The total PGR reaction volume was 50 |xi including 1 0 pmol of each primer and 10% DMSO 
(dimethylsulfoxlde, Sigma) (no DMSOforthe PCRof the Apo B gene), with --200 mg of genomic DNA used as template 
and a final dNTP concentration of 200 ^iM. Solutions were heated to BO'^G before the addition of ID Taq polymerase; 
PGR conditions were: 5 min at 95'G, followed by 2 cycles 30 sec 94*C, 30 sec 62*G, 30 sec 72'G, 2 cycles 30 sec 

45 94»c 30 sec 58*C, 30 sec 72^*0, 35 cycles of 30 sec at 94*C, 30 sec at 56*C, 30 sec at 72^0, and a final extension 
time of 2 min at 72''G. To remove unincorporated primers and nucleotides, amplified products were purified using the 
"QIAquick" (Qiagen, Germany )kit, with elution of the purified products in 50^lL of TE buffer (10 mM Tris-HCI, ImM 
EDTA, pH 8.0). 

50 Binding of the ampiified product on beads 

[0488] 1 0 |j.l of each purified amplified product was bound to 5|j.l DynaBeads (Dynal, M-280 Streptavidin) and dena- 
tured according to the protocol from Dynal. For the pentaplex tc-PROBE reaction the three different amplified product 
(bound on the beads) were pooled. 

55 

Tc-PROBE 

[0489] For the PROBE reaction the following primers were used: 
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(Apo A) P347: 5'-AGC GAG GAC AAG-3' (SEQ ID NO. 81) 
(Apo A) P360: 5'-ACA GCA GGA ACA GCA-3' (SEQ ID NO. 82) 
(Apo E) P112: 5'-GCG GAC ATG GAG GAC GTG-3' (SEQ ID NO. 83) 
(Apo E) P158: 5'-GAT GCC GAT GAC CTG CAG AAG-3'(SEQ ID NO. 84) 
5 (Apo B) P3500: 5'GTG GOG TGG AGG TTG AGT GAA GAG-3'(SEQ ID NO. 85) 

[0490] The tc-PROBE was carried out in a final volunne of 25 \i\ containing 1 0 pmol of each primer listed above, 2.5 
U Thermoquenase (Amershann), 2.5 [iL Thernnoquenase buffer, and 50 |aM dTTP (final concentrations) and 200 |xM 
of ddA/C/GTP, respectively. Tubes containing the mixture were placed in a themnocycler and subjected to the following 

10 cycling conditions: denatu ration (94*G) the supernatant was carefully removed from the beads and 'desalted' by ethanol 
precipitation to exchange nonvolatile cations such as Na+ and K+ with NH4+, which evaporated during the ionization 
process; 5 \iL 3M ammonium acetate (pH 6.5) 0.5 ^iL glycogen (10 mg/mL, Sigma), 25 [iL HgO, and 110 |iL absolute 
ethanol were added to 25 [iL PROBE supernatant and incubated for 1 hour at 4**C. After a 10 min. centrifugation at 
13,000 X 9, the pellet was washed in 70% ethanol and resuspended in 1 |iL 18 Mohm/cm HgO. A 0.35 p.L aliquot of 

15 resuspended DNA was mixed with 0.35 |iL matrix solution (0.7 M 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium 
citrate in 1 : 1 HgOiCHaCN) on a stain less steel sample target disk and allowed to air dry preceding spectrum acquisition 
using the Thermo Bioanalysis Version 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target 
and conversion dynode, respectively, Theoretical average molecular masses (M.|(calc)) of the fragments were calcu- 
lated from atomic compositions. External calibration generated from synthetic (ATCG)^ oligonucleotide (3.6-1 8kDa) 

20 was used. Positive ion spectra from 1-37500 Da were collected. 

RESULTS 

[0491] Table VIII shows the calculated molecular masses of all possible extension products including the mass of 
25 the primer itself. Fig. 91 shows a respective MALDI-TOP MS spectra of a tc-PROBE using three different templates 
and 5 different PROBE primers simultaneously in ne reaction. Comparison of the observed and calculated masses 
(see table VIII) allows a fast genetic profiling of various polymorphic sites in an individual DNA sample. The sample 
presented in Figure 91 is homozygous for threonine and glutamine at position 347 and 360, respectively, in the apol- 
ipoprotein A IV gene, bears the epsilon 3 allele homozygous in the apolipoprotein E gene, and is also homozygous at 
30 the codon 3500 for arginine in the apolipoprotein B gene. 


TABLE VIII 



SEQ ID 

mass 


Apolipoprotein A IV 

5'-AGCCAQQACAAG-3' (347) 

86 

3688.40 

unextended 
primer 

5'-AGCCAGGACAAGTC-3' 

87 

4265.80 

347Ser 

5'-AGCCAGGACAAGA-3' 

88 

3985.60 

347Thr 

5'-ACAGCACCAACAGCA-3'(360) 

89 

4604.00 

unextended 
primer 

5'-ACAGCAGGAACAGCATC-3' 

90 

5181.40 

360His 


50 
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SEQ ID 

mass 

allele 

5'-ACAGCAGGAACAGCAG-3' (112) 

91 

4917.20 

360Gln 

Apolipoprotein E 

5'-GCGGACATGGAGGACGTG-3' (112) 

92 

5629.60 

unextended 
primer 

5'-GCGGACATGGAGGACGTGGC-3' 

93 

6247.00 

112Cys 

5'-GCGGACATGGAGGACGTGC-3' 

94 

5902.80 

1 1 2Arg 

5'-GATGCCGATGACCTGCAGAAG-3'(158) 

95 

6480.20 

unextended 
primer 

5'-GATGCCGATGACCTGCAGAAGC-3' 

96 

6753.40 

158Arg 

5'-GATGCCGATGACCTGCAGAAGTG-3' 

97 

7097.60 

1 58Cys 

Apolipoprotein B-100 

5'-GTGCCCTGCAGCTTCACTGAAGAC-3'. 
(3500) 

98 

7313.80 

unextended 
primer 

5'-GTGCCCTGCAGCTTCACTGAAGACTG-3' 

99 

7931 .20 

3500Gln 

5'-GTGCCCTGCAGCTTCACTGAAGACC-3' 

100 

7587.00 

3500Arg 


30 EXAMPLE 25 

Sequencing Exons 5 to 8 of the p53 Gene by MALDI-TOF Mass Spectrometry 

MATERIALS & METHODS 

35 

[0492] Thirty-five cycles of PCR reactions were performed In a 96 well microliter plate with each well containing a 
total volume of 50 [li Including 200 ng genomic DNA, 1 unit Taq DNA polymerase, 1 .5 mM Mg CI2, 0.2mM dNTPx, 1 0 
pmol of the forward primer and 6 or 8 of the biotinylated reverse primer. The sequences of PCR primers prepared 
according to established chemistry (N.D. Sinha, J. Biernat, H. Kter, Tetrahed. Lett. 24:5843-5846 (1983) are as follows: 
40 exon 5:d(biotin-TATCTGTTCACTTGTGCCC SEQ ID NO. 101) and d(biotin-CAGAGGCCTGGGGACCCTG SEQ ID 
NO. 102); exon 6: D(ACGACAGGGCTGGTTGCC SEQ ID NO. 103) and d(biotin-ACTGACAACCACCCTTAAC SEQ 
ID NO. 104); exon 7: d(CTGCTTGCCACAGGTCTC SEQ ID NO. 105) and d(biotin-CACAGCAGGCCAGTGTGC SEQ 
ID NO. 106; exon 8: d(GGACCTGATTTGCTTACTG SEQ ID NO. 107) and d(biotin-TGAATGTGAGGCATAAGTG SEQ 
ID NO. 108). 

45 [0493] To each well of the 96-well microliter plate containing unpurified amplified product, 0.1 mg of paramagnetic 
streptavidin beads (Dynai) in 10 |il of 5 x binding solution (5 M NH4OH) was added and incubated at 3 7^*0 for 30 min. 
Then beads were treated with 0.1 M NaOH at room temperature for 5 min followed by one wash with 50 mM NH4OH 
at room temperature for 5 min followed by one wash with 50 mM Tris-HCI. 

[0494] Four dideoxy termination reactions were carried out in separate wells of the microliter plate. A total of 84 
50 reactions (21 primers x4 react ions/pr inner) can be perfomned in a single nnicroliter plate. To each well containing inn- 
mobilized single-stranded template, a total volute of 10 ^lI reaction mixture was added including 1x reaction buffer, 10 
pmol of sequencing primer, 250 mM of dNTPs, 25 mM of one of the ddNTPs, and 1—2 units of Thermosequenase 
(Amersham). Sequencing reactions were carried out on a thermal cycler using non-cycling conditions: SCO, 1 min, 
50'*C, 1 nnin, SCC to 72°C, ramping 0.1 "/sec, and 72''C, 5 min. The beads were then washed with 0.7 M ammonium 
55 citrate followed by 0.05 M ammonium citrate. Sequencing products were then removed from beads by heating the 
beads to 80*0 in 2 |^l of 50mM NH4OH for 2 min. The supernatant was used for MALDI-TOF MS analysis. 
[0495] Matrix was prepared as described in Kter, et al (Kter, H. etal., Nature Biotechnol. 14: 1123-1128 (1996)). This 
saturated matrix solution was then diluted 1 .52 times with pure water before use. 0.3 ^li of the diluted matrix solution 
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was then diluted 1 .52 times with pure water before use. 0.3 \l\ of the diluted matrix solution was loaded onto the sample 
target and allowed to crystallize followed by addition of 0.3 [i\ of the aqueous analyte. A Perseptive Voyager DE mass 
spectrometer was used for the experiments, and the samples were typically analyzed in the manual mode. The target 
and middle plate were kept at -i- 18.2 kV for 200 nanoseconds after each laser shot and then the garget voltage was 

5 raised to + 20kV. the ion guide wire in the flight tube was kept at - 2V. Normally, 250 laser shots were accumulated foe 
each sample. -^The original spectrum was acquired under 500 MHz digitizing rate, and the final spectrum was smoothed 
by a 455 point average (Savitsky and Golay, (1964) Analytical Chemistry, 36:1627). Default calibration of the mass 
spectrometer was used to identify each peak and assign sequences. The theoretical mass values of two sequencing 
peaks were used to recalibrate each spectrum. (D.P. Little, T.J. Cornish, M.J. O'Donnel, A. Braun, R.J. Cotter, H. Kter, 

10 Anal. Chem., submitted). 

RESULTS 

[0496] Alterations of the p53 gene are considered to be a critical step in the development of many human cancers 
15 (Greenblatt, et al., (1994) Cancer Res. 54, 4855--4878; C.C. Harris, (1996) J. Cancer, 73, 261-269; and D. Sidransky 
and M. Hollstein, (1996) Annu.Res.Med., 47,285-301). Mutations may serve as molecular indicators of clonality or as 
early markers of relapse in a patient with a previously identified mutation in a primary tumor (Hainaut, et al., (1997) 
Nucleic Acid Res., 25, 151-157). The prognosis of the cancer may differ according to the nature of the p53 mutations 
present (M.S. Goh etal., (1995) Cancer Res, 55, 5217-5221). Since the discovery of the p53 gene, more than 6000 
20 different mutations have been detected. Exons 5-8 were selected as sequencing targets where most of the mutations 
cluster (Hainaut etaL (1997) Nucleic Acids Res., 25, 151-7). 

[0497] Figure 96 schematically depicts the single tube process for target amplification and sequencing, which was 
performed, as described in detail in the Materials and Methods. Each of exon 5-8 of the p53 gene was PGR amplified 
using flanking primers in the intron region; the down stream primer was biotinylated. Amplifications of different exons 

25 were optimized to use the same cycling profile, and the products were used without further purification. PGR reactions 
were performed in a 96 well microliter plate and the product generated in one well was used as the template for one 
sequencing reaction. Streptavidin-coated magnetic beads were added to the same microliter plate and amplified prod- 
ucts were immobilized. The beads were then treated with NaOH to generate immobilized single-stranded DNA as 
sequencing template. The beads were washed extensively with Tris buffer since remaining base would reduce the 

30 activity of sequencing enzyme. 

[0498] A total of 21 primers were selected to sequence exon 5-8 of the p53 gene by primer walking. The 3'-end 
nucleotide of all the primers is located at the site where no known mutation exists. Four termination reactions were 
performed separately which resulted in a total of 84 sequencing reactions on the same PGR microliter plate. Non- 
cycling conditions were adopted for sequencing since streptavidin coated beads do not tolerate the repeated application 

35 of high temperature. Sequencing reactions were designed so that mt terminated fragments were under 70 nucleotides, 
a size range easily accessible by MALDI-TOF MS and yet long enough to sequence through the next primer binding 
site. Thermequenase was the enzyme of choice since it could reproducible generate a high yield of sequencing products 
in the desired mass range. After the sequencing reactions, the beads were washed with ammonium ion buffers to 
replace all other cations. The sequencing ladders were then removed from the beads by heating in ammonium hydrox- 

40 ide solution or simply in water. 

[0499] A sub-microliter aliquot of each of the 84 sequencing reactions was loaded onto one MS sample holder con- 
taining preloaded matrix. Figure 94 gives an example of sequencing data generated from one primer; four spectra are 
superimposed. 

[0500] All sequencing peaks were well resolved in the mass range needed to read through the next sequencing 
45 primer site. Sometimes doubly charged peaks were observed which could be easily identified by correlating the mass 
to thatof the singly charged ion. False stops generated by early termination of the enzymatic extension can be observed 
cleto the primer site. Since the mass resolution is high enough, it is easy to differentiate the false stop peaks from the 
real sequencing peaks by calculating the mass difference of the neighboring peaks and crs comparing the four spectra. 
Additionally, mt primers generated detectable data through the region of the downstream primer binding site thereby 
50 covering the false stop region. 

[0501] Using optimized procedures of amplification, sequencing, and conditioning, exons 5-8 of the p53 gene were 
successfully sequenced. Correct wildtype sequence data were obtained from all exons with a mass resolution about 
300 to 800 over the entire mass range. The overall mass accuracy is 0.05% or better. The average amount of each 
sequencing fragment loaded on the MS sample holder is estimated to be 50 fmol or less. 
55 [0502] This example demonstrates the feasibility of sequencing exons of a human gene by MALDI-TOF MS. Compare 
to gel-based automated fluorescent DNA sequencing, the read lengths are shorter. Microchip technology can be in- 
corporated to provide for parallel processing. Sequencing products generated in the microtiter plate can be directly 
transferred to a microchip which serves as a launching pad for MALDI-TOF MS analysis. Robot-driven serial and 
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parallel nanoliter dispensing tools are being used to produce 100-1000 element DNA arrays on < 1" square chips with 
flat or geometrically altered ( e.g. , with wells) surfaces for rapid mass spectrometric analysis. 

[0503] Figure 94 shows an MS spectrum obtained on a chip where the sample was transferred from a microtiter 
plate by a pintool. The estimated amount of each termination product loaded is 5 fmol or less which is in the range of 
5 amounts used in conventional Sanger sequencing with radiolabeled or fluorescent detection (0.5-1 fmol per fragment). 
The low volume MALDI sample deposition has the advantages of miniaturization (reduced reagent cts), enhanced 
reproducibility and automated signal acquisition. 

EXAMPLE 26 

10 

Direct detection of synthetic and biologically generated double-stranded DNA by MALDI-TOF MS 
Introduction 

15 [0504] Typically, matrix-associated laser desorption/ionizatlon (Karas, et. a!., (1 989) Int. J. Mass Spectrom, ton Proc- 
esses, 92, 231) time-of-f light mass spectrometry (MALDI-TOF MS) of DNA molecules which are double stranded (ds) 
in solution yields molecular ions representative of the two single stranded components (Tang, ef a/, (1994) Rapid 
Commun. Mass Spectrom. 8:1 83; Tang, etal. (1 995) Nucleic Acids Res. 23 :31 26; Benner, etal. ( 1 995) Rapid Commu n . 
Mass Spectrom. 9:537; Liu, etal. (1 995) Anal. Chem. 67:3482; Siegert etal. (1996) Anal. Biochem. 243 :55; and Dok- 

20 tycz, etal. (1 995) Anal. Biochem. 230 :205); this has been observed in several reports dealing with biologically generated 
DNA from a polymerase chain reaction (PGR) amplification (Tang, etal. (1994) Rapid Commun. Mass Spectrom. 8: 
183; Liu, etal. (1995) Anal. Chem. 67 :3482; Siegert etal. (1996) Anal. Biochem. 243 :55; and Doktycz, etal. (1995) 
Anal. Biochem. 230 :205). It is not clear whether the double strand is destabilized because of the decreased pH in the 
matrix environment or because of absorbance by the duplex during desorption/ionizat ion/acceleration of an energy 

25 sufficient to overcome the attractive van derWaals and "stacking" stabilization forces (Cantor and Shimmel, Biophysical 
Chemistry Part I: The conformation of Biomolecules , W.H. Freeman, New York, (1980), 176). When analyte is present 
at high concentrations formation of non-specific gas-phase DNA multimers is, as with proteins (Karas, et. al., (1989) 
Int. J. Mass Spectrom, Ion Processes 92 :231), common; however, Lecchiand Pannell (Lecchi etal. (1995) J. Am. Soc. 
Mass Spectrom. 6:972) have provided strong evidence for spec/ffc Watson Crick (WC) base pairing being maintained 

30 in the gas phase. They detected these specific dimers when using 6-aza-2-thiothymine as a matrix, but did not observe 
them with 3-hydroxypicclinic acid (3-HPA) or 2,4,6- hydroxyacetophenone matrix. As described below, by using a low 
acceleration voltage of the ions and preparing samples for MALDI analysis at reduced temperatures, routine detection 
of dsDNA is possible. 

35 MATERIALS AND METHODS 

[0505] Synthetic DNA. Oligonucleotides were synthesized (SInha, et al. (1984) Nucleic Acids Res., 12, 4539) on a 
Perspective Expedite DNA synthesizer and reverse phase HPLC purified in-house. Sequences were: 50-mer (15337 
Da): 5'-TTG CGT ACA CAC TGG CCG TCG TTT TAG AAC GTC GTG ACT GGG AAA ACC CT-3' (SEQ ID NO. 1 09); 

40 27-merj, (complementary, 8343 Da): 5'-GTA AAA CGA CGG CCA GTG TGT ACG CAA-3' (SEQ ID NO. 110); 27-mernc 
(non-complementary, 8293 Da): 5'-TAC TGG AAG GCG ATCTCA GCA ATC AGC-3' (SEQ ID NO. 111). ^00[LM stock 
solutions were diluted to 20, 10, 5, and 2.5 ^M using 18Mohm/cm H2O. 2\lL each of equimolar solutions of the 50-mer 
and either or 27-mernc were mixed and allowed to anneal at room temperature for 1 0 minutes. 0.5^iL of these 

mixtures were mixed directly on a sample target with 1 \lL matrix (0.7 M 3-HPA, 0.07 M ammonium citrate in 50% 

45 aceto n itrl le) a n d a 1 1 owed to al r d ry. 

Biological DNA. Enzymatic digestion of human genomic DNA from leukocytes was performed. PCR primers (forward, 
5'-GGC ACG GCT GTC CAA GGA G-3' (SEQ ID NO. 112)); reverse, 5'-AGG CCG CGCTCG GCG CCCTC-3' (SEQ 
ID NO. 1 1 3) to amplify a portion of exon 4 of the apolipoprotein E gene were delineated from the published sequence 
(Das et al., (1985) J. Biol. Ctiem., 260 6240). Taq polymerase and 10x buffer were purchased from Boehringer-Man- 

50 nheim (Germany) and dNTPs from Pharmacia (Freiburg, Germany). The total reaction volume was 50 i^l including 20 
pmol of each primer and 1 0% DMSO (dimethylsulfoxide, Sigma) with approximately 200 ng of genomic DNA used as 
template. Solutions were heated to SO^C before the addition of lU polymerase; PCR conditions were: 2 min at 94°C, 
followed by 40 cycles of 30 sec at 94*C, 45 sec at 63''C, 30 sec at 72''C, and a final extension time of 2 min at 72^*0. 
While no quantitative data was collected to determine the final yield of amplified product, it is estimated that -2pmol 

55 were available for the enzymatic digestion. 

[0506] Cfol and Rsal and reaction buffer L were purchased from Boehringer-Mannheim. 20\x.\ of amplified products 
were dil uted with 1 5\i\ water and 4|a.l buffer L; after addition of 1 0 u nits of restriction enzymes the samples were incubated 
for 60 min at 37''C. For precipitation of digest products S^lI of 3M ammonium acetate (pH 6.5), (5^lI glycogen (Braun, 
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et ai (1 997) Clin. Chem. 43: 1 1 51 ) (1 Omg/ml, Sigma), and 1 1 absolute ethanol were added to 50^lL of the analyte 
solutions and stored for 1 hour at room temperature. After at 1 0 min centrifugation at 1 3,000 X g, the pellet was washed 
in 70% ethanol and resuspended in 1|xl 18Mohm/cm HgO. 

[0507] Sample preparation and analysis by MALDl-TOF MS. 0.35|a.l of resuspended DMA was mixed with 0.35-1 .3 
5 ^lL matrix solution (0.7M 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1:1 HaOiCHjCN) (Wu, et ai. 
(1 993) Rapid Commun. Mass Spectrom. 7:142) on a stainless steel sample target disk and allowed to air dry preceding 
spectrum acquisition using a Thermo Bioanalysis Vision 2000 MALDl-TOF instrument operated in pitive ion reflectron 
nnode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular masses (M^ 
(calc)) of the fragments were calculated from atomic compositions; the mass of a proton (1 .08 Da) was subtracted from 
10 raw data values in reporting experimental molecular masses (Mr(exp)) as neutral basis. External calibration generated 
from eight peaks (2000-18000 Da) was used for all spectra. 

Results and Discussion 

15 [0508] Figure 96A Is a MALDl-TOF mass spectrum of a mixture of the synthetic 50-mer with (non-complementary) 
27-mernc (each 10 \lM, the highest final concentration used in this study); the laser power was adjusted to just above 
the threshold irradiation for ionization. The peaks at 8.30 and 15.34 kDa represent singly charged ions derived from 
the 27- and 50-mer single strands, respectively. Poorly resolved low intensity signals at -1 6.6 and -30.7 kDa represent 
homodimers of 27-and 50-mer, respectively; that at 23.6 kDa is consistent with a heterodimer containing one 27-mer 

20 and one 50-mer strand. Thus low intensity dimer ions representing all possible combinations from the two non-com- 
plementary oligonucleotides (27-1-27; 27+50; 50-1-50) were observed. Increasing the irradiance even to a point where 
depurination peaks dominated the spectrum resulted in slightly higher intensities of these dimer peaks. Note that the 
hybridization was perfomned at room temperature and with a very low salt concentration, conditions at which non- 
specific hybridization may occur. 

25 [0509] Figure 96 shows a MALDl-TOF spectrum of the same 50-mer mixed with (complementary) 27-mer(;; the final 
concentration of each oligonucleotide was again 10|iM. Using the same laser power as in Figure 96A, intense signals 
were again observed at 88.34 and 15.34 Kda, consistentwith single stranded 27- and 50-mer, respectively. Homodimer 
peaks (27 + 27; 50 + 50) were barely apparent in the noise; however, singly (23.68 Kda) and doubly (11 .84k Da) charged 
heterodimer (27 + 50) peaks were dominant. Although the 23.68 Kda dimer peak could be detected from all irradiated 

30 positions, its intensity relative to the monomer peaks varied slightly from spot-to-spot. Repeating the experiment with 
individual oligonucleotide concentrations of 5, 2.5, and 1.25 p.M resulted in decreasing amounts of the 27-/50-mer 
Watson-Crick dimer peak relative to the 27- and 50-mer single stranded peaks. At the lowest concentrations, the ob- 
servation of dimer was "crystal-dependent", that is, irradiation of some crystals produced significant 27-/50-mer dimer 
signal, while other crystals reproducibly yielded very little or none. This indicates that the incorporation of dsdna into 

35 the matrix crystals or the effectiveness of retaining this interaction through the ionization/desorption process is depend- 
ent upon the microscopic properties of the crystals, and/or that there exist steep concentration gradients of the duplex 
throughout the sample. 

[0510] Thus the Figure 96 spectra provide strong evidence that specific WC base paired dsdna can be observed 
using gentle laser conditions with high concentrations of oligonucleotides in this mass range, the first report of this 
40 using a 3-HPA matrix. The study was extended to a complex mixture of dsdna derived from an enzymatic digest (Rsal/ 
Cfol) of a region of exon 4of the apoiipoprotein E gene (Das et. al., (1985) J. Biol. Chem., 260 6240); expected fragment 
masses are given in Table IX. 


Table IX 


Cfol/Rsal Digestion Products from ApoE gene exon 4^ 

bases'^ 

ssDNA 

(Da) 

dsdna (Da) 

(+) (-) 

(+) 

(-) 


11 13 

3428 

4025 

7453 

16 

5004 

4924 

9928 

IS 

5412 

5750 

11162 

17 19 

5283 

5880 

11163 

19 

5999 

5781 

11780 


allele has no 17/19 or 19/19 pairs; G4 allele contains no 36/38 pair. 
^(+) sense strand, (-) aniisense strand 


85 


EP 1 164 203 A2 


Table IX (continued) 


Cfol/Rsal Digestion Products from ApoE gene exon 4^ 

basest 

ssDNA 

(Da) 

dsdna (Da) 

24 

22 

7510 

6745 

14225 

31 

29 

9628 

9185 

18813 

36 

38 

11279 

11627 

22906 

48 

14845 

14858 

29703 

55 

53 

17175 

16240 

33415 


allele has no 17/19 or 19/19 pairs; G4 allele contains no 36/38 pair. 
^[+) sense strand, (-) aniisense strand 


After the digestion step, tine samples were purified and concentrated by etiianol precipitation and resuspended in IjxL 
H2O before mixing thenn at room temperature with matrix on tlie sample target. Nearly 20 peaks ranging in mass from 
3.4-1 7.2 Kda were resolved in the products' MALDI spectrum (Figure 97 A), all consistent with denatured single stranded 
components of the double strand (Table IX). Many such analyses of similar biological products over a period of months 
also yielded spectra with negligible dsdna, consistent with previous reports (Tang, et ai (1 994) Rapid Commun. Mass 
Spectrom. 8:183; Liu, etaL (1 995) Anal. Chem. 67 :3482; Siegert eta!. (1996) Anal. Biochem. 243 :55; and Doktycz, ef 
a/. (1995) Anal. Biochem. 230 :205); contrarily, intact double strands were observed under similar conditions for the 
synthetic DNA (Figure 96A). It is difficult to estimate the strand concentration available after the biological reactions, 
but presumably that it was far lower than that at which dimerization of synthetic samples occurred. Furthermore, main- 
taining specific hybrids within the two -component synthetic mixture may be kinetically favored relative to the far more 
complex mixture of 20 single-stranded DNA components from the digest. 

[0511] The effect of reduced temperature on maintaining ds DNA was tested. An aliquot of the digested DNAsoluticn, 
the matrix, pipette, pipette tips, and the stainless steel sample target were stored in a 4*C "cold room" for 1 5 minutes; 
as with normal preparations matrix, and then analyte, were spotted on the target and allowed to co-crystallize while 
air drying. Crystallization for mixtures of 300 nL 3HPA (50% acetonitrile) with 300 nL analyte required — 1 minute at 
room temperature but —15 minutes at the reduced temperature. Sample spots prepared in the cold room environment 
typically contained a high proportion of large transparent crystals. 

[0512] MALDI-TOF analysis of an ApoE digest aliquot prepared at reduced temperature produced the Figure 97B 
spectrum. While the low mass range appeared qualitatively similar to Figure 97A, dramatic differences above 8 kDa 
were observed. Only signals consistent with single strands (Table IX) were observed in Figure 97A, but the Figure 97B 
cold room prepared samples did not yield signals for the same masses except below 8 kDa. Even more striking were 
the additional high mass peaks in Figure 97B; clearly these represent dimer peaks containing lower mass components. 
As was done with the synthetic DNA, it was important to determine whetherthese represent non-specific heterodimers, 
specific WC heterodimers, or nonspecific homodimers. Consider first the 33.35 kDa fragment. Ignoring the unlikely 
possibility that the high mass fragment represents a trimer or higher multimer, as a dimer it must only contain the 
highest mass ssDNA components, i.e. , the >16 kDa. Homodimerization of the 15.24 and 17.18 kDa fragments would 
result in 32.49 and 34.35 kDa peaks, respectively; corresponding mass errors for these incorrect assignments relative 
to the observed 33.35 kDa would be -2.6%and-i-3.0% respectively. A far better match is achieved if this peak originates 
from a heterodimer of the two highest mass single stranded fragments; their summed mass (1 6.24+1 7.1 8 ^ 33.42 kDa) 
differed by 0.2% from the observed dimer mass 33.35 kDa, an acceptable mass error for MALDI-TOF analysis of large 
DNA fragments using external calibration. Likewise, the 29. 66kDa fragment was measured only 0.13% lower than the 
29.70 Da expected for a heterodimer of 48-mers; the sum of no other possible homodimers or heterodimers were within 
a reasonable range of this mass. Similarargumsnts could be madefor the 22.89 and 1 8.83 kDafragments, representing 
36-/38-mer and 31 -/29-hst6rodim6rs, respectively; the signal at 14.86 kDa is consistent with singly charged single 
stranded and doubly charged double-stranded 48-mer. The agreement of the Figure 97B masses above 15 kDa with 
the of dsDNA expected from this digest and the absence of homodimers and non-specific heterodimers at random 
masses indicated that the base pairings were indeed highly specific and provided further evidence that gas-phase WC 
interactions may be retained in MALDI -generated ions. 

[0513] Figure 98 shows a MALDI-TOF spectrum of an e4 allele, which, unlike the e3, was expected to yield no 36-/ 
38-mer pair upon Cfol/Rsal digestion. The e3 and e4 mass spectra were similar except that abundant 22.89 kDa 
fragment in Figure 97B was not present in Figure 98; with this infomnation alone (Table IX) G3 and G4 alleles were 
easily distinguished, thereby demonstrating the genotyping by direct measurement of dsDNA by MALDI-TOF MS. 
Similarly dsDNA could be ionized, transferred to the gas phase, and detected by MALDI-TOF MS. The acceleration 
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voltage typically employed on our instrument was only - 5kV corresponding to 1 .5kV/mm up to -2 mm from the sample 
target, with the electric field strength decreasing rapidly with distance from the sample target. Most previous work used 
at least 20k\/ acceleration (Lecchi etaf. (1 995) J. Am. Soc. Mass Spectrom. 6:972); in one exception a 27-mer dsDNA 
was detected using a frozen matrix solution and 100 V acceleration (Nelson, etal. (1990) Rapid Commun. Mass Spec- 

5 trom. 4:348). Without being bound by any theory MALDI-induced "denaturation" of dsDNA may be due to gas-phase 
collisional activation that disrupts the WC pairing when high acceleration fields are employed, analogous to the dena- 
turation presumed to be afirst step in the fragmentation used for sequencing the single stranded components of dsDNA 
using electrospray ionization (McLafferty etaf. (1 996) Int. J. Mass Spectrom., Ion Processes ). It appears that the high 
salt concentrations (typically > 10mM NaCI or KCI) required to stabilize WC paired dsDNA in solution are unsuitable 

10 forMALDI analysis (Nordhoff efa/. (1993) Nucleic Acids Res. 21:3347); reducing the concentration of such non-volatile 
cations is necessary to avoid cation-adducted MALDI signals, but destabilizes the double strands in solution. The low 
pH conditions of the matrix environment should also destabilize the duplex. As shown in Figures 97B and 98, storing 
and preparing even low concentrations of the biological samples at reduced temperature at least in part offset these 
denaturing effects, especially for longer strands where melting temperatures are higher due to a more extensive hy- 

15 drogen bonding network. The conditions used here are recognized to be very non-stringent annealing conditions. 

[0514] The low mass tails on high mass dsDNA peaks ( e.g. . Figure 97B, 232 kDa) are consistent with depurination 
generated to a higher extent than the sum of depurination from each of the single strands combined. Although depu- 
rination in solution is an acid-catalyzed reaction, the weakly acidic conditions in the 3-HPA matrix do not induce sig- 
nificant depurination; molecular ion signals from a mixed-base 50-mer measured with De-MALDI-TOF had only minor 

20 contributions from depurination peaks (Juhaz, efa/. (1996) Anal. Chem. 68:941). Depurination from the single stranded 
components of the gas-phase dsDNA is observed even though these bases are expected to be hydrogen bonded to 
the complementary base of the accompanying strand, implying that covalent bonds are being broken before the strand 
is denatured. 

25 EXAMPLE 27 

Efficiency and Specificity Assay for Base-Specific Ribonucieases 

[0515] Aliquots sampled at regular time intervals during digestion of selected synthetic 20 to 25 mers were analyzed 
30 by mass spectrometry. Three of the RNAses were found to be efficient and specific. These include: the G-specific T^, 
the A-specific U2 and the A/U-specific PhyM. The ribonucieases presumed to be C-specific were found to be less 
reliable, e.g. , did not cleave at every C or also cleaved at U in an unpredictable manner The three promising RNAses 
all yielded cleavage at all of the predicted positions and a complete sequence coverage was obtained. In addition, the 
presence of cleavage products containing one or several uncleaved positions (short incubation times), allowed align- 
35 ment of the cleavage products. An example of the MALDI -spectrum of an aliquot sampled after T^ digest of a synthetic 
20-m6r [SEQ ID NO:114] RNA is shown in Figure 100. 

EXAIVIPLE 28 

'io Immobilization of amplified DMA targets to siiicon wafers 
Silicon surface preparation 

[0516] Silicon wafers were washed with ethanol, flamed over bunsen burner, and immersed in an anhydrous solution 
45 of 25% (by volume) 3-aminopropyltriethoxysilane in toluene for 3 hours. The silane solution was then removed, and 
the wafers were washed three times with toluene and three times with dimethyl sulfoxide (DMSO). The wafers were 
then incubated in a 1 GmM anhydrous solution of N-succinimidyl (4-iodoacetyl) aminobenzoate(SIAB) (Pierce Chemical, 
Rockford, I L) in anhydrous DMSO. Followingthereaction, the SI AB solution was removed, and the wafers were washed 
three times with DMSO. In all cases, the iodoacetamido-functionalized wafers were used immediately to minimize 
50 hydrolysis of the labile iodoacetamido-functionality. Additionally, all further wafer manipulations were performed in the 
dark since the iodoacetamido-functionality is light sensitive. 

Immobilization of amplified tiiioi-containing nucleic acids 

55 [0517] The SIAB-conjugated silicon wafers were used to analyze specific free thiol-containing DNA fragments of a 
particular amplified DNAtarget sequence. A 23-meroligodeoxynucleotide containing a5'-disulfide linkage [purchased 
from Operon Technologies; SEQ ID NO: 117] that is complementary to the 3'-region of a 112 bp human genomic DNA 
template [Genebank Acc. No.: Z52259; SEQ ID NO: 118] was used as a primer in conjunction with a commercially 
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available49-mer primer, which is complementary to a portion of the 5'-end of the genomic DNA [purchased from Operon 
Technologies; SEQ ID NO: 119], in PGR reactions to amplify a 135 bp DNA product containing a 5'-disulfide linkage 
attached to only one strand of the DNA duplex [SEQ ID NO: 120]. 

[0518] The PGR amplification reactions were perfomned using the Amplitaq GoldKit [Perkin Elmer Gatalog No. 
5 N808-0249]. Briefly, 200 ng 112 bp human genomic DNA template was incubated with 10 jiM of 23-mer primer and 
8\iM of commen^ially available 49-mer primer, 10 mM dNTPs, 1 unit of Amplitaq Gold DNA polymerase in the buffer 
provided by the manufacturer and PGR was performed in a thermocycler. 

[0519] The 5'-disulfide bond of the resulting amplified product was fully reduced using 1 0 mM tris-(2-carboxyethyl) 
phosphlne (TCEP) (Pierce Ghemical, Rockford, IL) to generate a free 5'-thiol group. Disulfide reduction of the modified 

10 oligonucleotide was monitored by observing a shift in retention time on reverse-phase FPLC. It was determined that 
after five hours in the presence of 1 0 mM TCEP, the disulfide was fully reduced to a free thiol. Immediately following 
disulfide cleavage, the modified oligonucleotide was incubated with the iodacetamido-functionalized wafers and con- 
jugated to the surface of the silicon waferthrough the SIAB linker. To ensure complete thiol deprotonation, the coupling 
reaction was perfomned at pH 8.0. Using 1 0mM TCEP to cleave the disulfide and the other reaction conditions described 

15 above, it was possible to reproducibly yield a surface density of 250 fmol per square mm of surface. 

Hybridization and IVIALDi-TOF Mass spectrometry 

[0520] The silicon wafer conjugated with the 135 bp thiol-containing DNA was incubated with a complementary 
20 12-msr oligonucleotide [SEQ ID NO: 121] and specifically hybridized DNA fragments were detected using MALDI-TOF 
MS analysis. The mass spectrum revealed a signal with an observed experimental mass-to-charge ratio of 3618.33; 
the theoretical mass-to-charge ratio of the 12-mer oligomer sequence is 3622.4 Da. 

[0521] Thus, specific DNA target molecule that contain a 5'-disulfid6 linkage can be amplified. The molecules are 
immobilized at a high density on a SIAB-derivatized silicon wafer using the methods described herein and specific 
25 complementary oligonucleotides may be hybridized to these target molecules and detected using MALDI-TOF MS 
analysis. 

EXAIVIPLE 29 

30 Use of i-ligli Density Nucieic Acid immobiiization to Generate Nucleic Acid Arrays 

[0522] Employing the high density attachment procedure described in EXAMPLE 28, an array of DNA oligomers 
amenable to MALDI-TOF mass spectrometry analysis was created on a silicon wafer having a plurality of locations, e^ 
c[., depressions or patches, on its surface. To generate the array, a free thiol-containing oligonucleotide primer was 
35 immobilized only at the selected locations of the wafer [ e.g. , see EXAMPLE 28]. The each location of the array contained 
one of three different oligomers. To demonstrate that the different immobilized oligomers could be separately detected 
and distinguished, three distinct oligonucleotides of differing lengths that are complementary to one of the three oli- 
gomers were hybridized to the array on the wafer and analyzed by MALDI-TOF mass spectrometry. 

'io Oligodeoxynucieotides 

[0523] Three sets of complementary oligodeoxynucleotide pairs were synthesized in which one member of the com- 
plementary oligonucleotide pair contains a 3'- or 5'-disulfide linkage [purchased from Operon Technologies or Oligos, 
Etc.]. For example, Oligomer 1 [d(CTGATGCGTGGGATCATC I I I I I l-SS); SEQ ID NO: 122] contains a 3'-disulfide 
45 linkage whereas Oligomer 2 [d(SS-CCTGTTGGGAACTGTGTAGTATT); a 5'-disulfide derivative of SEQ ID NO: 117] 
and Oligomers [d(SS-GAATTCGAGCTCGGTAGCCGG); a 5'-disulfide derivative of SEQ ID NO: 115] each contain a 
5'-disulfide linkage. 

[0524] The oligonucleotides complementary to Oligomers 1 -3 were designed to be of different lengths that are easily 
resolvable from one another during MALDI-TOF MS analysis. For example, a 23-m6r oligonucleotide [SEQ ID NO: 
50 123] was synthesized complementary to a portion of Oligomer 1, a 12-mer oligonucleotide [SEQ ID NO: 121] was 
synthesized complementary to a portion of Oligomer 2 and a 21 -mer [SEQ I D NO: 11 6] was synthesized complementary 
to a portion of Oligomer 3. In addition, a fourth 29-mer oligonucleotide [SEQ ID NO: 124] was synthesized that lacks 
complementarity to any of the three oligomers. This fourth oligonucleotide was used as a negative control. 

55 
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Silicon surface cliemistry and DNA immobilization 
(a) 4 X 4 (16-location) array 

5 [0525] A 2 X 2 cm2 silicon wafer having 256 individual depressions or wells in the form of a 1 6 X 1 6 well array was 
purchased from a commercial supplier [Accelerator Technology Corp., College Station, Texas]. The wells were 800 X 
800 |j,m2, 120 [xm deep, on a 1 .125 pitch. The silicon wafer was reacted with S-aminopropyltriethoxysilane to produce 
a uniform layer of primary amines on the surface and then exposed to the heterobifunctional crosslinkerSlAB resulting 
In iodoacetamido functionalities on the surface [e.g. , see EXAMPLE 28]. 

10 [0526] To prepare the oligomers for coupling to the various locations of the silicon array, the disulfide bond of each 
oligomer was fully reduced using 10 mM TCEP as depicted in EXAMPLE 28, and the DNA resuspended at a final 
concentration of 1 0 |iM in a solution of 1 00 mM phosphate buffer, pH 8.0. Immediately following disulfide bond reduction, 
the free-thiol group of the oligomer was coupled to the iodoacetamido functionality at 16 locations on the wafer using 
the probe coupling conditions essentially as described above in EXAMPLE 28. To accomplish the separate coupling 

15 at 16 distinct locations of the wafer, the entire surface of the wafer was not flushed with an oligonucleotide solution 
but, instead, an ~30-nl aliquot of a predetermined modified oligomer was added in parallel to each of 16 locations (i. 
e., depressions) of the 256 wells on the wafer to create a 4 x 4 array of innmobilized DNA using a robotic pintool. 
[0527] The robotic pintool consists of 16 probes housed in a probe block and mounted on an X Y, Z robotic stage. 
The robotic stage was a gantry system which enables the placement of sample trays below the arms of the robot. The 

20 gantry unit itself is composed of X and Y arms which move 250 and 400 mm, respectively, guided by brushless linear 
servo motors with positional feedback provided by linear optical encoders. A lead screw driven Z axis (50 mm vertical 
travel) is mounted to the xy axis slide of the gantry unit and is controlled by an in-line rotary servo motor with positional 
feedback by a motor-mounted rotary optical encoder. The work area of the system is equipped with a slide-out tooling 
plate that holds five microtiter plates (most often, 2 plates of wash solution and 3 plates of sample for a maximum of 

25 1152 different oligonucleotide solutions) and up to ten 20x20 mm wafers. The wafers are placed precisely in the plate 
against two banking pins and held secure by vacuum. The entire system is enclosed in plexi-glass housing for safety 
and mounted onto a steel support frame for thermal and vibrational damping. Motion control is accomplished by em- 
ploying a commercial motion controllerwhich was a 3-axis servo controller and is integrated to a computer; program- 
ming code for specific applications is written as needed. 

30 [0528] To create the DNA array, a pintool with assemblies that have solid pin elements was dipped into 1 6 wells of 
a multi-well DNA source plate containing solutions of Oligomers 1-3 to wet the distal ends of the pins, the robotic 
assembly movesthe pin assemblyto the silicon wafer, and the sample spotted by surface contact. Thus, one of modified 
Oligomers 1-3 was covalently immobilized to each of 16 separate wells of the 256 wells on the silicon wafer thereby 
creating a 4 x 4 array of immobilized DNA. 

35 [0529] In carrying out the hybridization reaction, the three complementary oligonucleotides and the negative control 
oligonucleotide were mixed at a final concentration of 1 0 )aM for each oligonucleotide in 1 ml of TE buffer [1 0 mM Tris- 
HCI, pH 8.0, 1 mM EDTA] supplemented with 1 M NaCI, and the solution was heated at 65*'C for 10 min. Immediately 
thereafter, the entire surface of the silicon wafer was flushed with 800 ^lI of the heated oligonucleotide solution. The 
complementary oligonucleotides were annealed to the immobilized oligomers by incubating the silicon array at ambient 

40 temperature for 1 hr, followed by incubation at 4*C for at least 10 min. Alternatively, the oligonucleotide solution can 
be added to the wafer which is then heated and allowed to cool for hybridization. 

[0530] The hybridized array was then washed with a solution of 50 mM ammonium citrate buffer for cation exchange 
to remove sodium and potassium ions on the DNA backbone (Pieles etaL, (1993) Nucl. Acids Res. 21 :3191-3196). A 
6-nl aliquot of a matrix solution of 3-hydroxypicolinic acid [0.7 M 3-hydroxypicolinic acid-10 % ammonium citrate in 50 

45 % acetonltrlie; see Wu ef a/. Rapid Commun. Mass Spectrom. 7:142-146 (1 993)] was added in series to each location 
of the array using a robotic piezoelectric serial dispenser (i.e. , a piezoelectric pipette system). 
[0531] The piezoelectric pipette system is built on a system purchased from Microdrop GmbH, Norderstedt Germany 
and contains a piezoelectric element driver which sends a pulsed signal to a piezoelectric element bonded to and 
surrounding a glass capillary which holds the solution to be dispensed; a pressure transducer to load (by negative 

50 pressure) or empty (by positive pressure) the capillary; a robotic xyz stage and robot driver to maneuver the capillary 
for loading, unloading , dispensing, and cleaning, a stroboscope and driver pulsed at the frequency of the piezo element 
to enable viewing of 'suspended' droplet characteristics; separate stages for source and designation plates or sample 
targets (i.e. Si chip); a camera mounted to the robotic arm to view loading to designation plate; and a data station which 
controls the pressure unit, xyz robot, and piezoelectric driver 

55 [0532] The 3-HPA solution was allowed to dry at ambient temperature and thereafter a 6-nl aliquot of water was 
added to each location using the piezoelectric pipette to resuspend the dried matrix-DNA complex, such that upon 
drying at ambient temperature the matrix-DNA complex forms a uniform crystalline surface on the bottom surface of 
each location. 
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MALDI-TOF MS analysis 

[0533] The MALDI-TOF MS analysis was performed in series on each of the 16 locations of the hybridization array 
illustrated in Figure 6 essentially as described in EXAMPLE 28. The resulting mass spectrum of oligonucleotides that 
5 specifically hybridized to each of the 1 6 locations of the DNA hybridization revealed a specific signal at each location 
representative of observed experimental mass-to-charge ratio corresponding to the specific complementary nucleotide 
sequence. 

[0534] For example, in the locations that have only Oligomer 1 conjugated thereto, the mass spectrum revealed a 
predominate signal with an observed experimental mass-to-charge ratio of 7072.4 approximately equal to that of the 
10 23-mer; the theoretical mass-to-charge ratio of the 23-mer is 7072.6 Da. Similarly, specific hybridization of the 1 2-mer 
oligonucleotide to the array, observed experimental mass-to-charge ratio of 3618.33 Da (theoretical 3622.4 Da), was 
detected only at those locations conjugated with Oligomer 2 whereas specific hybridization of MJM6 (observed exper- 
imental mass-to-charge ratio of 641 5.4) was detected only at those locations of the array conjugated with Oligomer 3 
[theoretical 6407.2 Da]. 

15 [0535] None of the locations of the array revealed a signal that corresponds to the negative control 29-mer oligonu- 
cleotide (theoretical mass-to-charge ratio of 8974.8) indicating that specific target DNA molecules can be hybridized 
to oligomers covalently immobilized to specific locations on the surface of the silicon array and a plurality of hybridization 
assays may be individually monitored using MALDI-TOF MS analysis. 

20 (b) 8 X 8 (64- location) array 

[0536] A 2 X 2 cm^ silicon wafer having 256 individual depressions or wells that form a 1 6 X 1 6 array of wells was 
purchased from a commercial supplier [Accelerator Technology Corp., College Station, Texas]. The wells were 800 X 
800 |im2, 120 ^.m deep, on a 1 .125 pitch. The silicon wafer was reacted with 3-aminopropyltriethoxysilane to produce 
25 a uniform layer of primary amines on the surface and then exposed to the heterobifunctional crosslinkerSlAB resulting 
in iodoacetamido functionalities on the surface as described above. 

[0537] To make an array of 64 elements, a pintool was used following the procedures described above. The pintool 
was dipped into 16 wells of a 384 well DNA source plate containing solutions of Oligomers 1 -3, moved to the silicon 
wafer, and the sample spotted by surface contact. Next, the tool was dipped in washing solution, then dipped into the 
30 same 1 6 wells of the source plate, and spotted onto the target 2.25mm offset from the initial set of 1 6 spots; the entire 
cycle was repeated to make a 2x2 array from each pin to produce an 8x8 array of spots (2x2 elements/pin X 1 6 pins 
= 64 total elements spotted). 

[0538] Oligomers 1-3 immobilized to the 64 locations were hybridized to complementary oligonucleotides and ana- 
lyzed by MALDI-TOF MS analysis. As observed for the 16-location array, specific hybridization of the complementary 
35 oligonucleotide to each of the immobilized thiol-containing oligomers was observed in each of the locations of the DNA 
array. 

EXAMPLE 30 

'io Extension of hybridized DNA primers bound to DNA templates immobilized on a silicon wafer 

[0539] The SIAB-derlvatized silicon wafers can also be employed for primer extension reactions of the immobilized 
DNA template using the procedures essentially described in EXAMPLE 7. 

[0540] A 27-mer oligonucleotide [SEQ ID NO: 1 25] containing a 3'-f ree thiol group was coupled to a SIAB-derivatized 
45 silicon wafer as described above, for example, in EXAMPLE 28. A 1 2-mer oligonucleotide primer [SEQ ID NO: 1261 
was hybridized to the immobilized oligonucleotide and the primer was extended using a commercially available kit [e. 
g., Sequenase or ThermoSequenase, U.S. Biochemical Corp]. The addition of Sequenase DNA polymerase or Ther- 
moSequenase DNA polymerase in the presence of three deoxyribonucleosids triphosphates (dNTPs; dATP, dGTP, 
dCTP) and dideoxyribonucleoside thymidine triphosphate (ddTTP) in buffer according to the instructions provided by 
50 the manufacturer resulted in a 3-base extension of the 1 2-mer primer while still bound to the silicon wafer. The wafer 
was then analyzed by MALDI-TOF mass spectrometry as described above. The mass spectrum results clearly distin- 
guish the 15-mer [SEQ ID NO: 127] from the original unextended 1 2-mer thus indicating that specific extension can 
be performed on the surface of a silicon wafer and detected using MALDI-TOF MS analysis. 

55 
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EXAMPLE 31 

Effect of linker length on polymerase extension of hybridized DNA primers bound to DNA templates 
immobilized on a silicon wafer 

5 

[0541] The effect of the distance between the SIAB-conjugated silicon surface and the duplex DNA formed by hy- 
bridization of the target DNA to the immobilized oligomer template was investigated, as well as choice of enzyme. 
[0542] Two SIAB-derivatized silicon wafers were conjugated to the3'-end of two freethiol-containing oligonucleotides 
of identical DNA sequence except for a 3-base poly dT spacer sequence incorporated at the 3'-end: 

10 

CTGATGCGTC QGATCATCTT TTTT SEQ ID No. 122 

CTGATGCGTC GGATCATCTT TTTTTTT SEQ ID No. 125. 

15 

These oligonucleotides were synthesized and each was separately immobilized to the surface of a silicon wafer through 
the SIAB cross-linker [ e.g. , see EXAMPLE 28]. Each wafer was incubated with a 12-mer oligonucleotide; 


20 AAAAAAGATG AT SEQ ID No. 126 

GATGATCCGA CG SEQ ID No. 128 

GATCCGACGC AT SEQ ID No. 129, 

25 

which is complementary to portions of the nucleotide sequences common to both of the oligonucleotides, by denaturing 
at 75 *'C and slow cooling the silicon wafer. The wafers were then analyzed by MALDI-TOF mass spectrometry as 
described above. 

[0543] As described in EXAMPLE 30 above, a 3-base specific extension of the bound 12-mer oligonucleotide was 
30 observed using the oligomer primer where there is a 9-base spacer between the duplex and the surface [SEQ ID NO: 
125]. Similar results were observed when the DNA spacer lengths between the SIAB moiety and the DNA duplex were 
0, 3, 6 and 12. In addition, the extension reaction may be performed using a variety of DNA polymerases, such as 
Sequenase and Thermo Sequenase (US Biochemical). Thus, the SIAB linker may be directly coupled to the DNA 
template or may include a linker sequence without effecting primer extension of the hybridized DNA. 

35 

EXAMPLE 32 

Spcctrochip mutant detection in ApoE gene 

40 [0544] This example describes the hybridization of an immobilized template, primer extension and mass spectrometry 
for detection of the wildtype and mutant Apolipoprotein E gene for diagnostic purposes. This example demonstrates 
that immobilized DNA molecules containing a specific sequence can be detected and distinguished using primer ex- 
tension of unlabeled allele specific primers and analysis of the extension products using mass spectrometry. 
[0545] A 50 base synthetic DNA template complementary to the coding sequence of allele 3 of the wildtype apoli- 

45 poproteln E gene: 

5'- GCCTGGTACACTGCCAGGCGCTTCTGCAGGTCATCGGCATCGCGGAGGAG -3' 
50 [SEQ ID NO: 280] 

or complement to the mutant apolipoprotein E gene carrying a G ->A transition at codon 158: 

55 5'-GCCTGGTACACTGCCAGGCACTTCTGCAGGTCATCGGCATCGCGGAGGAG-3' 
[SEQ ID NO: 281] 
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containing a 3'-free thiol group was coupled to separate SIAB-derivatized silicon wafers as described in Example 28. 
[0546] A 21 -mer oligonucleotide primer: 

5'-GAT GCC GAT GAC CTG CAG AAG-3' [SEQ ID NO: 282] was hybridized to each of the immobilized templates and 
the primer was extended using a commercially available kit [e.g., Sequenase or Thermosequenase, U.S. Biochemical 

5 Corp]. The addition of Sequenase DNA polymerase or Thermosequenase DNA polymerase in the presence of three 
deoxyribcnucleosidetriphosphates (dNTPs; dATP, dGTP, dTTP) and dideoxyribonucleosidecytosinetriphosphate (dd- 
CTP) in buffer according to the instructions provided by the manufacturer resulted in a single base extension of the 
21 -mer primer bound to the Immobilized template encoding the wlldtype apolipoprotein E gene and a three base ex- 
tension of the 21 -mer primer bound to the immobilized template encoding the mutant form of apolipoprotein E gene. 

10 [0547] The wafers were analyzed by mass spectrometry as described herein. The wlldtype apolipoprotein E se- 
quence results in a mass spectrum that distinguishes the primer with a single base extension (22-mer) with a mass to 
charge ratio of 6771 .1 7 Da (the theoretical mass to charge ratio Is 6753.5 Da) from the original 21 -mer primer with a 
mass to charge ration of 6499.64 Da. The mutant apolipoprotein E sequence results in a mass spectrum that distin- 
guishes the primer with a three base extension (24-mer) with a mass to charge ratio of 7386.9 (the theoretical mass 

15 charge is 7386.9) from the original 21 -mer primer with a mass to charge ration of 6499.64 Da. 

EXAMPLE 33 

Detection of Double-Stranded Nucleic Acid Molecules via Strand Displacement and Hybridization to an 
20 Immobilized Complementary Nucleic Acid 

[0548] This example describes immobilization of a 24-mer primer and the specific hybridization of one strand of a 
duplex DNA molecule, thereby permitting amplication of a selected target molecule in solution phase and pemnitting 
detection of the double stranded molecule. This method is useful for detecting single base changes, and, particularly 
25 for screening genomic libraries of double-stranded fragments. 

[0549] A 24-mer DNA primer CTGATGCGTC GGATCATGTTTTTT SEQ ID No. 122, containing a 3'-free thiol group 
was coupled to a SIAB -derivatized silicon wafer as described in Example 29. 
[0550] An 18-mer synthetic oligonucleotide: 

5'-GTGATGCGTCGGATCATC-3' [SEQ ID NO: 286] was premixed with a 1 2-mer 5'-G ATGATCCGAGG-3' [SEQ ID NO: 
30 285] that has a sequence that is complementary to 12 base portion of the 1 8-mer oligonucleotide. The oligonucleotide 
mix was heated to 75'G and cooled slowly to room temperature to faciliate the formation of a duplex molecule: 

5' -CTGATGCGTCGGATCATC-3 ' [SEQ ID NO. . 28 6] 
35 3' - GCAGCCTAGTAG-5' [SEQ ID NO: 287]. 

[0551] The specific hybridization of the 1 2-mer strand of the duplex molecule to the immobilized 24-mer primer was 
carried out by mixing IjiM of the duplex molecule using the hybridization conditions described in Example 30. 
[0552] The wafers were analyzed by mass spectrometry as described above. Specific hybridization was detected in 
40 a mass spectrum of the 1 2-mer with a mass to charge ratio of 3682.78 Da. 

EXAMPLE 34 

1 -(2-Nitro-5-(3-0-4,4'-dimethoxytritylpropoxy)plienyl)-1-0-((2-cyanoethoxy)-diisopropylaminophosphmo) 
45 ethane 

A. 2-Nitro-5-(3-hydroxypropoxy)benzaldeliyde 

[0553] 3-Bromo-1-propanol (3.34 g, 24 mmol) was refluxed in 80 ml of anhydrous acetonitrile with 5-hydroxy-2-ni- 
50 trobenzaldehyde (3.34 g, 20 mmol), K2CO3 (3.5 g), and Kl (100 mg) overnight (15 h). The reaction mixture was cooled 
to room temperature and 1 50 ml of methylene chloride was added. The mixture was filtered and the solid residue was 
washed with methylene chloride. The combined organic solution was evaporated to dryness and redissolved in 100 
ml methylene chloride. The resulted solution was washed with saturated NaCI solution and dried over sodium sulfate. 
4.31 g (96%) of desired product was obtained after removal of the solvent in vacuo. 
55 Rj= 0.33 (dichloromethane/methanol, 95/5). 

UV (methanol) maximum: 313, 240 (shoulder), 215 nm; minimum: 266 nm. 

1H NMR (DMSO-dg) 6 10.28 (s, 1 H), 8.17 (d, 1 H), 7.35 (d, 1H), 7.22 (s, 1 H), 4.22(t, 2H), 3.54 (t, 2H), 1 .90 (m, 2H). 
13C NMR (DMSO-dg) 5 189.9, 153.0, 141.6, 134.3, 127.3, 118.4, 114.0, 66.2, 56.9, 31.7. 
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B. 2-Nitro-5-(3-0-t-butyldimethylsilylpropoxy)benzaldehyde 

[0554] 2-Nitro-5-(3-hydroxypropoxy)benzaldehyde(1 g, 4.44 mmol) was dissolved in 50 ml anhydrous acetonitriie. 
To this solution, it was added 1 ml of triethylamine, 200 mg of imidazole, and 0.8 g (5.3 mmol) of tBDMSCl.The mi)cture 

5 was stirred at room temperature for 4 h. Methanol (1 ml) was added to stop the reaction. The solvent was removed in 
vacuo and ths solid residue was redissolved in 100 ml methylene chloride. The resulted solution was washed with 
saturated sodium bicarbonate solution and then water. The organic phase was dried over sodium sulfate and the 
solvent was removed in vacuo. The crude mixture was subjected to a quick silica gel column with methylene chloride 
to yield 1 .44 g (96%) of 2-nitro-5-(3-0-t-butyldimethylsilylpropoxy)benzaldehyde. 

10 Rj = 0.67 (hexane/ethyl acetate, 5/1 ). 

UV (methanol), maximum: 317, 243, 215 nm; minimum: 235, 267 nm. ""H NMR (DMSO-dg) 5 10.28 (s, 1H), 8.14 (d, 
1 H), 7.32 (d, 1 H), 7.20 (s, 1 H), 4.20 (t, 2H), 3.75 (t, 2H), 1 .90 (m, 2H), 0.85 (s, 9H), 0.02 (s, 6H). 13C NMR (DMSO-dg) 
5 189.6, 162.7, 141.5, 134.0, 127.1, 118.2, 113.8, 65.4, 58.5, 31.2, 25.5, -3.1,-5.7. 

15 c. 1 -(2-Nltro-5-(3-0-t-butyldimethylsllylpropoxy)phenyl)ethanol 

[0555] High vacuum dried 2-nitro-5-(3-0-t-butyldimethylsilylpropoxy)benzaldehyde (1 .02 g, 3 mmol) was dissolved 
50 ml of anhydrous methylene chloride. 2 M Trimethylaluminium in toluene (3 ml) was added dropwise within 10 min 
and l<eeped the reaction mixture at room temperature. It was stirred further for 1 0 min and the mixture was poured into 
20 10 ml ice cooled water. The emulsion was separated from water phase and dried over 100 g of sodium sulfate to 
remove the remaining water. The solvent was removed in vacuo and the mixture was applied to a silica gel column 
with gradient methanol in methylene chloride. 0.94 g (86%) of desired product was isolated. 
Rf = 0.375 (hexane/ethyl acetate, 5/1). 

UV (methanol), maximum: 306, 233, 206 nm; minimum: 255, 220 nm. ""H NMR (DMSO-dg) 5 8.00 (d, 1H), 7.36 (s, 1H), 
25 7.00 (d, 1 H), 5.49 (b, OH), 5.31 (q, 1 H), 4.19 (m, 2H), 3.77 (t, 2H), 1 .95 (m, 2H), 1 .37 (d, 3H), 0.86 (s, 9H), 0.04 (s, 6H). 
NMR (DMSO-de) 5 162.6, 146.2, 139.6, 126.9, 112.9, 112.5, 64.8, 63.9, 58.7, 31.5, 25.6, 24.9, -3.4, -5.8. 

D. 1-(2-Nitro-5-(3-hydroxypropoxy)phenyl)ethanol 

30 [0556] 1 -(2-Nitro-5-(3-0-t-butyldimethylsilylpropoxy)phenyl)ethanol (0.89 g, 2.5 mmol) was dissolved in 30 ml of THF 
and 0.5 mmol of nBu4NF was added under stirring. The mixture was stirred at room temperature for 5 h and the solvent 
was removed in vacuo. The remaining residue was applied to a silica gel column with gradient methanol in methylene 
chloride. 1-(2-Nitro-5-(3-hydroxypropoxy)phenyl)ethanol (0.6 g (99%) was obtained. 
Rf=0.17 (dichloromethane/methanol, 95/5). 

35 UV(methanol),maximum:304, 232, 210nm; minimum: 255, 219 nm. 1H NMR (DMSO-dg) 5 8.00 (d, 1H), 7.33 (s, 1H), 
7.00 (d, 1H), 5.50 (d, OH), 5.28 (t, OH), 4.59 (t, 1H), 4.17 (t, 2H), 3.57 (m, 2H), 1.89 (m, 2H), 1.36 (d, 2H). 
NMR (DMOS-dg) 5 162.8, 146.3, 139.7, 127.1, 113.1, 112.6, 65.5, 64.0, 57.0, 31.8, 25.0. 

E. 1-(2-Nltro-5-(3-0-4,4'-dimethoxytrltylpropoxy)phenyl)ethanol 

40 

[0557] 1-(2-Nitro-5-(3-hydroxypropoxy)phenyl)ethanol (0.482 g, 2 mmol) was co-evaporated with anhydrous pyridine 
twice and dissolved in 20 ml anhydrous pyridine. The solution was cooled in ice-water bath and 750 mg (2.2 mmol) of 
DMTCI was added. The reaction mixture was stirred at room temperature overnight and 0.5 ml methanol was added 
to stop the reaction. The solvent was removed in vacuo and the residue was co-evaporated with toluene twice to 
45 remove trace of pyridine. The final residue was applied to a silica gel column with gradient methanol in methylene 
chloride containing drops of triethylamine to yield 0.96 g (89%) of the desired product 1 -(2-nitro-5-(3-0-4,4'-dimethox- 
ytrityl-propoxy)phenyl)ethanol. 
Rf=0.50 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 350 (shoulder), 305, 283, 276 (shoulder), 233, 208 nm; minimum: 290, 258, 220 nm. 
50 iH NMR (DMSO-dg) 5 8.00 (d, 1H), 6.82-7.42 (ArH), 5.52 (d, OH), 5.32 (m, 1 H), 4.23 (t, 2H), 3.71 (s, 6H), 3.17 (t, 2H), 
2.00 (m, 2H), 1.37 (d, 3H). 

13c NMR (DMOS-dg) 5 162.5, 157.9, 157.7, 146.1, 144.9, 140.1, 139.7, 135.7, 129.5, 128.8, 127.6, 127.5, 127.3, 
126.9, 126.4, 113.0, 112.8, 112.6, 85.2, 65.3, 63.9, 59.0, 54.8, 28.9, 24.9. 

55 F. 1-(2-Nitro-5-(3-0-4,4'-dimethoxytritylpropDxy)phenyl)-1-0-((2-cyanDethoxy)-diisDpropyiaminophosphino) 
ethane 

[0558] 1 -(2-Nitro-5-(3-0-4,4'-dimethoxytritylpropo)fy)phenyl)ethanol (400 mg, 0.74 mmol) was dried under high vac- 
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uum and was dissolved in 20 ml of anhydrous methylene chloride. To this solution, it was added 0.5 ml N,N-diisopro- 
pylethylamine and 0.3 ml (1.34 mmol) of 2-cyanoethyl-N,N-diisopropylchlorophosphoramidite. The reaction mixture 
was stirred at room temperature for 30 min and 0.5 ml of methanol was added to stop the reaction. The mixture was 
washed with saturated sodium bicarbonate solution and was dried over sodium sulfate. The solvent was removed in 
5 vacuo and a quick silica gel column with 1 % methanol in methylene chloride containing drops of triethylamine yield 
510 mg (93%) the desired phosphoramidite. 
Rf=0.87 (dichloromethane/methanol, 99/1). 

EXAMPLE 35 

10 

1-(4-(3-0-4,4'-Dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)-1-0-((2-cyanoethoxy)- 
dlisopropylaminophosphino)ethane 

A. 4-(3-Hydroxypropoxy)-3-methoxyacetophenone 

15 

[0559] 3-Bromo-1 -propanol (53 ml, 33 mmol) was refluxed in 1 00 ml of anhydrous acetonitrile with 4-hydroxy-3-meth- 
oxyacetophenone (5 g, 30 mmol), K2CO3 (5 g), and Kl (300 mg) overnight (15 h). 

Methylenechloride (150 ml) was added to the reaction mixture after cooling to room temperature. The mixture was 
filtered and the solid residue was washed with methylene chloride. The combined organic solution was evaporated to 
20 dryness and redlssolved in 1 00 ml methylene chloride. The resulted solution was washed with saturated NaCI solution 
and dried over sodium sulfate. 6.5 g (96.4%) of desired product was obtained after removal of the solvent in vacuo. 
Rf=0.41 (dichloromethane/methanol, 95/5). 

UV (methanol), maximum: 304, 273, 227, 210 nm: minimum: 291 , 244, 214 nm. 

1H NMR (DMSO-dg) 5 7.64 (d, 1 H), 7.46 (s, 1 H), 7.04 (d, 1 H), 4.58 (b, OH), 4.1 2 (t, 2H), 3.80 (s, 3H), 3.56 (t, 2H), 2.54 
25 (s, 3H), 1.88 (m, 2H). ^^C NMR (DMSO-dg) 6 196.3, 152.5, 148.6, 129.7, 123.1, 111.5, 110.3, 65.4, 57.2, 55.5, 31.9, 
26.3. 

B. 4-(3-Acetoxypropoxy)-3-methoxyacetophenone 

30 [0560] 4-(3-Hydroxypropoxy)-3-methoxyacetophenone (3.5 g, 15.6 mmol) was dried and dissolved in 80 ml anhy- 
drous acetonitrile. This mixture, 6 ml of triethylamine and 6 ml of acetic anhydride were added. After 4 h, 6 ml methanol 
was added and the solvent was removed in vacuo. The residue was dissolved in 1 00 ml dichloromethane and the 
solution was washed with dilute sodium bicarbonate solution, then water. The organic phase was dried over sodium 
sulfate and the solvent was removed. The solid residue was applied to a silica gel column with methylene chloride to 

35 yield 4.1 g of 4-(3-acetoxypropoxy)-3-methoxyacetophenone (98.6%). 
Rf = 0.22 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 303, 273, 227, 210 nm; minimum: 290, 243, 214 nm. 

1H NMR (DMSO-dg) 5 7.62 (d, 1H), 7.45 (s, 1H), 7.08 (d, 1H), 4.12 (m, 4H, 3.82 (s, 3H), 2.54 (s, 3H), 2.04 (m, 2H), 
2.00 (s, 3H). 

40 13c NMR (DMSO-dg) 5 196.3, 170.4, 152.2, 148.6, 130.0, 123.0, 111.8, 110.4, 65.2, 60.8, 55.5, 27.9, 26.3, 20.7. 

C . 4-(3 - Acetoxyp ro poxy)-3-met hoxy-6- n itroaceto p hen o ne 

[0561] 4-(3-Acetoxypropoxy)-3-methoxyacetophenone (3.99 g, 15 mmol) was added portionwise to 15 ml of 70% 
45 HNO3 In water bath and keep the reaction temperature at the room temperature. The reaction mixture was stirred at 
room temperature for 30 min and 30 g of crushed ice was added. This mixture was extracted with 100 ml of dichlo- 
romethane and the organic phase was washed with saturated sodium bicarbonate solution. The solution was dried 
over sodium sulfate and the solvent was removed in vacuo. The crude mixture was applied to a silica gel column with 
gradient methanol in methylene chloride to yield 3.8 g (81 .5%) of desired product 4-(3-ac6toxypropoxy)-3-methoxy- 
50 6-nitroacetophenone and 0.38 g (8%) of ipso-substituted product 5-(3-acetoxypropoxy)-4-methoxy-1 ,2-dinitrobenzene. 
Side ipso-substituted product 5-(3-acetoxypropoxy)-4-methoxy-1 ,2-dinitrobenzene: 
RpO.47 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 334, 330, 270, 240, 212 nm; minimum: 310, 282, 263, 223 nm. 
1H NMR (CDCI3) 6 7.36 (s, 1H), 7.34 (s, 1H), 4.28 (t, 2H), 4.18 (t, 2H), 4.02 (s, 3H), 2.20 (m, 2H), 2.08 (s, 3H). 
55 13c NMR (CDC|3) 5 170.9, 152.2, 151.1, 117.6, 111.2, 107.9, 107.1, 66.7, 60.6, 56.9, 28.2, 20.9. 
Desired product 4-(3-acetoxypropoxy)-3-methoxy-6-nitroacetophenone: 
Rf=0.29 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 344, 300, 246, 213 nm; minimum: 320, 270, 227 nm. 
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1H NMR (CDCI3) 57.62 (s, 1 H), 6.74 (s, 1 H),4.28 (t, 2H), 4.20 (t, 2H), 3.96 (s, 3H), 2.48 (s, 3H), 2.20 (m, 2H), 2.08 (s, 3H). 
13C NMR (CDCI3) 6 200.0, 171.0, 154.3, 148.8, 138.3, 133.0, 108.8, 108.0, 66.1, 60.8, 56.6, 30.4, 28.2, 20.9. 

D. 1-(4-(3-Hydroxypropoxy)-3-methoxy-6-nitrophenyl)ethanol 

5 

[0562] 4-(3-Acetoxypropoxy)-3-nnethoxy-6-nitroacetophenone (3.73 g, 1 2 mmol) was added 1 50 ml ethanol and 6.5 
g of K2CO3. The mixture was stirred at room t6mp6raturefor4ln andTLCwitln 5% methanol in dichloromethane indicated 
the completion of the reaction. To this same reaction mixture, it was added 3.5 g of NaBH4 and the mixture was stirred 
at room temperature for 2h. Acetone (10 ml) was added to react with the remaining NaBH4. The soivent was removed 
10 in vacuo and the residue was uptalten into 50 g of siiica gei. The siiica gei mixture was applied on the top of a siiica 
gei column with 5% methanoi in methylene chloride to yieid 3.15 g (97%) of desired product 1 -(4-(3-hydroxypropoxy)- 
3-methoxy-6-nitrophenyl)ethanol. 

Intermediate product 4-(3-hydroxypropoxy)-3-methoxy-6-nitroacetophenone after deprotection: 
Rf=0.60 (dichioromethane/methanoi, 95/5). 
15 Final product 1-(4-(3-hydroxypropoxy)-3-methoxy-6-nltrophenyl)ethanoi: 
Rf=0.50 (dichioromethane/methanoi, 95/5). 

UV (methanol), maximum; 344, 300, 243, 219 nm; minimum: 317, 264, 233 nm. 

1H NMR (DMSO-dg) 5 7.54 (s, 1 H), 7.36 (s, 1H), 5.47 (d, OH), 5.27 (m, 1H), 4.55 (t, OH), 4.05 (t, 2H), 3.90 (s, 3H), 
3.55 (q, 2H), 1 .88 (m, 2H), 1 .37 (d, 3H). 
20 13c NMR (DIV/ISO-dg) 5 153.4, 146.4, 138.8, 137.9, 109.0, 108.1 , 68.5, 65.9, 57.2, 56.0, 31 .9, 29.6. 

E. 1-(4-(3-0-4,4'-Dimethoxytritylpropoxy)-3-methoxy-6 -nitrophenyl)ethanol 

[0563] 1-(4-(3-Hydroxypropoxy)-3-methoxy-6-nitrophenyl)ethanol (0.325 g, 1 .2 mmol) was co-evaporated with an- 
25 hydrous pyridine twice and dissolved in 15 ml anhydrous pyridine. The solution was cooied in ice-water bath and 450 
mg (1 .33 mmoi) of DMTCi was added. The reaction mixture was stirred at room temperature overnight and 0.5 ml 
methanoi was added to stop the reaction. The solvent was removed in vacuo and the residue was co-evaporated with 
toiuene twice to remove trace of pyridine. The final residue was applied to a siiica gei column with gradient methanol 
in methylene chloride containing drops of triethylamineto yieid 605 mg (88%) of desired product 1-(4-(3-0-4,4'-dimeth- 
30 oxytritylpropoxy)-3-methoxy-6-nitrophenyi)ethanol. Rf = 0.50 (dichioromethane/methanoi, 95/5). 
UV (methanol), maximum: 354, 302, 282, 274, 233, 209 nm; minimum: 322, 292, 263, 222 nm. 
iH NMR (DMSO-dg) 5 7.54 (s, IN), 6.8-7.4 (ArH), 5.48 (d, OH), 5.27 (m, 1H), 4.16 (t, 2H), 3.85 (s, 3H), 3.72 (s, 6H), 
3.15 (t, 2H), 1 .98 (t, 2H), 1 .37 (d, 3H). 

i3CNMR(DMSO-d6)8157.8, 153.3, 146.1, 144.9, 138.7, 137.8, 135.7, 129.4,128.7, 127.5, 127.4, 126.3, 112.9, 112.6, 
35 108.9, 108.2, 85.1 , 65.7, 63.7, 59.2, 55.8, 54.8, 29.0, 25.0. 

F. 1-(4-(3-0-4,4'-Dimethoxytritylpropoxy)-3-methoxy-6 -nitrophenyl)-1 -0-((2-cyanoethoxy)- 
dlisopropylaminophoEphino)ethane 

40 [0564] 1 -(4-(3-0-4,4'-Dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyi)ethanol (200 mg, 3.5 mmoi) was dried under 
high vacuum and was dissolved in 15 ml of anhydrous methylene chloride. To this solution, it was added 0.5 ml rsl,N- 
dilsopropylethylamine and 0.2 ml (0.89 mmol) of 2-cyanoethyl-N,N-diisopropylchlorophosphoramidite. The reaction 
mixture was stirred at room temperature for 30 min and 0.5 ml of methanol was added to stop the reaction. The mixture 
was washed with saturated sodium bicarbonate solution and was dried over sodium sulfate. The solvent was removed 

45 In vacuo and a quick silica gel column with 1 % methanol in methylene chloride containing drops of triethylamine yield 
247 mg (91.3%) the desired phosphoramidite 1-(4-(3-0-4,4'-dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)-1-0-( 
(2-cyanoethoxy)-diisopropylaminophosphino)ethane. 
Rf=0.87 (dichioromethane/methanoi, 99/1). 

50 EXAMPLE 36 

Oligonucleotide synthesis 

[0565] The oligonucleotide conjugates containing photocleavable linker were prepared by solid phase nucleic acid 
55 synthesis (see: Sin ha et al. Tetrahedron Lett. 1 983, 24, 5843-5846; Si nha etal. Nucleic Acids Res. 1 984, ^2., 4539-4557; 
Beaucage et al. Tetrahedron 1993, 49, 6123-6194; and Matteucci et al. J. Am. Chem. Soc. 1981, 103 , 3185-3191) 
under standard conditions. In addition a longer coupling time period was employed for the incorporation of photocleav- 
able unit and the 5' terminal amino group. The coupling efficiency was detected by measuring the absorbance of 
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released DMT cation and the results indicated a comparable coupling efficiency of phosphoramidite 1 -(2-nitro-5-(3-0- 
4,4'-dinnethoxytritylpropoxy)phenyl)-1 -0-((2-cyanoethoxy)-diisopropylanninophosphino)ethane or 1 -(4-(3-0-4,4'- 
dinnethoxytritylpropoxy)-3-nriethoxy-6-nitrophenyl)-1-0-((2-cyanoethoxy)-diisopropylanninophosphino)ethane with 
those of connmon nucleoside phosphoramodites. Deprotection of the base protection and release of the conjugates 
fronn the solid support was carried out with concentrated ammonium at 55 "'C overnight. Deprotection of the base 
protection of other conjugates was done by fast deprotection with AMA reagents. Purification of the MMT-on conjugates 
was done by HPLC (trityl-on) using 0.1 M triethylammonium acetate, pH 7.0 and a gradient of acetonitrile (5% to 25% 
in 20 minutes). The collected MMT or DMT protected conjugate was reduced in volume, detritylated with 80% aqueous 
acetic acid (40 min, 0 *C), desalted, stored at -20*C. 

EXAMPLE 37 

Photolysis study 

[0566] In a typical case, 2 nmol of oligonucleotide conjugate containing photocleavable linker in 200 [l\ distilled water 
was irradiated with a long wavelength UV lamp (Blak Ray XX-15 UV lamp, Ultraviolet products, San Gabriel, CA) at a 
distance of 10 cm (emission peak 365 nm, lamp intensity = 1.1 mW/cm^ at a distance of 31 cm). The resulting mixture 
was analyzed by HPLC (trityl-off) using 0.1 M triethylammonium acetate, pH 7.0 and a gradient of acetonitrile. Analysis 
showed that the conjugate was cleaved from the Under within minutes upon UV irradiation. 

Equivalents 

[0567] Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, 
numerous equivalents to the specific procedures described herein. Such equivalents are considered to be within the 
scope of this invention and are covered by the following claims. 


96 


EP 1 164 203 A2 


SEQUENCE LISTING 


^ (1) GENER^J "INFORMATION 

(i) APPLICANT: 

(A) NAME: SEQUENOM, INC. 

(B) STREET: 1155 5 Sorreiito Valley Road 

(C) CITY: San Diego 

(D) STATE: California 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 92121 

(i) INVENTOR/APPLICANT: 

.. (A) NAME: Hubert Koster 

(B) STREET: 836 Via Mallorca Drive 

to CITY: La Jolla 

<D) STATE: California 

{D} COUNTRY: USA 

(E) POSTAL CODE (ZIP) : S2037 

{i) INVENTOR/APPLICANT: 
^° (A) NAME: Kai Tang 

(B) STREET: 8521 Summerdale Rd #241 

{c) CITY: San Diego 

{D) STATE: California 

(D) . COUNTRY: USA 

(E> POSTAL CODE (ZIP) i 92126 

25 

{ i ) INVENTOR/ APPLICANT : 

(A) NAME: Tong-Jing Fu 

(B) STREET: 10615 Dabney Dr, #21 

(C) CITY: San Diego 
<D} STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (2IP) : 92126 

(i) INVENTOR/APPLICANT: 

(A) NAME: Carsten W. Siegert 

(B) STREET: Geilstr. 42 
35 (C) CITY: 22303 Hamburg 

(D) STATE: 

(D) COUNTRY: Germany 

(E) POSTAL CODE <ZIP) : 

(i) INVENTOR/APPLICANT: 

40 (A) NAME: Daniel P. Little 

(B) STREET: 393 Glendale Lake Rd. 

iC) CITY: Patton 

(D) STATE: Pennsylvania 
{D) COUNTRY: USA 

(E) POSTAL CODE {2IP) : 18668 

45 

(i) INVENTOR /APPLICANT: 

(A) NAME: G. Scott Higgins 

(B) STREET: Haselweg 1 

(C) CITY: 22880 Weidel 

(D) STATE: 

50 (D) COUNTRY: Germany 

(E) POSTAL CODE (ZIP) : 
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(i) INVENTOR/APPLICANT: 

(A) NAME: Andreas Braun 

(B) STREET: 13232 Benchley Road 

(C) CITY: San Diego 

(D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92130 

(i) IKVENTOR/ APPLICANT: 

(A) NAME: Brigitte Damhof f er-Demar 

(B) STREET: 3899 Haines St. #8-306 

(C) CITY: San Diego 

(D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92109 

(i) INVENTOR/APPLICANT; 

(A) NAME: Christian Jurinke 

(B) STREET: Grope Hall 68 

(C) CITY: 22115 Hamburg 

(D) STATE: 

(D) COUNTRY: Germany 

(E) POSTAL CODE (ZIP) : 

(i) INVENTOR/ APPLICANT: 

(A) NAME: Dirk Van den Boom 

(B) STREET: Forsthausstr . 8 

(C) CITY: 633303 Preiech 

(D) STATE: 

(D) COUNTRY: Germany 

(E) POSTAL CODE (ZIP) : 

(i) INVENTOR/APPLICANT: 

(A) NAME: Gcublng Xiang 

(B) STREET: 11381 Zapata Ave, 

(C) CITY: San Diego 

(D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92126 

(ii) TITi;,E OF THE INVENTION: DNA DIAGNOSTICS BASED ON MASS SPECTROMETRY 

(iii) NUMBER OF SEQUENCES: 320 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Brown, Martin, Haller & McClain 

(B) STREET: 1660 Union Street 

(C) CITY: San Diego 

(D) STATE; CA 

(E) COUNTRY: USA 

(F) ZIP: 92101-2926 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible - 
(C} OPERATING SYSTEM: DOS 

tD) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
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<B1 FILING DATE: 0€ -NOV- 1997 
(C) CLASSIFICATION: 

hV^X) prior application DATA: 
<A) APPLICATION NUMBER: 
(B) FILING DATE: lO/OB/97 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/933,792 

(B) FILING DATE: 09/19/9 7 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/787,639 

(B) FILING DATE: 01/23/97 

tvVi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: O9/7SS,908 

(B) PILING DATE: 01/23/97 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/74S,0S5 

(B) FILING DATE: 11/06/96 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/746,036 

(B) FILING DATE: 11/06/96 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/744,590 

(B) FILING DATE; 11/C6/96 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/744,481 

(B) FILING DATE: 11/06/96 

tviii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Seidman, Stephanie L 

(B) REGISTRATION NUMBER: 33,779 

(C) REFERENCE /DOCKET NUMBER: 73S2-2004PC 

(ix> TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-238-0999 

(B) TELEFAX: 619-238-0062 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
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GCAAGTGAAT CCTGAGCGTG 

' (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 
{v) FRAGMENT TYPE: 
{vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GTGTGAAGGG TTCATATGC 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
tB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTICENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATCTATATTC ATCATAGGAA ACACCACA 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 
CD) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 
tv) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTATCTATAT TCATCATAGG AAACACCATT 

(2) INFORMATION FOR SEQ ID U's 

ii) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: unkngwn 

"(ii) MOLECULE TYPE: cDHA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID WO: 5: 
GCTTTGGGGC ATGGACATTG ACCCGTATAA 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: unioiown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
tv) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTGACTACTA ATTCCCTGGA TGCTGGGTCT 

(2) INFORMATION FOR SEQ ID NO: 7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: uuicnown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TTGCCTGAGT GCAGTATGGT 

( 2 ) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

ix±) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AGCTCTATAT CGGGAAGCCT * 2 0 

(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ! single 
CD) TOPOLOGY: unknown 

(ii) ^40LECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

txi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
TTGTGCCACG CGGTTGGGAA TGTA 24 

{2} INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : Single 
{D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AGCAACGACT GTTTGCCCGC CAGTTG 26 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
{B) TYPE: nucleic acid 

{01 STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE! WO 

(v) FRAGMENT TYPE: 
tvi) ORIGINAL SOURCE: 

(Xi) SEQXJBNCE DESCRIPTION: SEQ ID NO: 11; 
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TACATTCCCA ACCGCGTGGC ACAAC 2 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

Ci) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AKTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AACTGGCGGG CAAACAGTCG TTGCT 25 
(2) INFORMATION FOR SEQ ID HO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: oDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEKSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 13: 
GCAAGTGAAT CCTGAGCGTG 20 
(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknoi«i 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID U0:14: 

GTGTGAAGGG CGTG 14 

(2) INFORMATION FOR SEQ ID NO : 1 b 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
■ (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CTATATTCAT CATAGGAAAC ACCA 24 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GTCACCCTCG ACCTCCAG 18 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TTGTAAAACG ACGGCCAGT 19 
(2) INFORMATION FOR SEQ ID NO: IB: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(v) FRAGMENT TYPE; 

(vi) ORIGIloa. SOURCE: 

(Xi> SEQUENCE DESCRIPTION: SEO ID NO: 18 

CTTCCACCGC GATGTTGA 

{2} INFORMATION FOR SEQ ID NO: 19: 

(1) SEQUENCE CBARACTERISTZCS : 
(A) LENGTH: 17 base paira 
fB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
CAGGAAACAG CTATGAC 

{2} INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: xinknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GTAAAACGAC QGCCAGT 

.<2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE; nucleic acid 
<C} STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ IT) KO Zl 
GTCACCCTCG ACCTGCAGC 
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(2) INFORMATION FOR SEQ ID NO: 22: 

- ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : single 
(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv> ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
GTTGTAAAAC GAGGGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
iiv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
TCTGGCCTGG TGCAGGGCCT ATTGTAGTTG T6ACGTACA 
(2) INFORMATION FOR SEQ ID N0:24 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 14 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ingl e 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
TGTACGTCAC AACT 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 
.,,~Jiv) ANTISENSE: NO 

' (v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

AAGATCTGAC CAGGGATTCG GTTAGCGTGA CTGCTGCTGC TGCTGCTGCT GCTGGATGAT 6 0 
CCGACGCATC AGATCTGG 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: \mknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: HO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CTGATGCGTC GGATCATC IS 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
GATGATCCGA CGCATCACAG CTC 23 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ -ID NO:28 
TC3C"~CCAA GAGCTGTGAT GCGTCGGATC ATC 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 
tD} TOPOLOGY: unknown 

Hi} MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:29 

GATGATCCGA CGCATCACAG CTC 

(2) INFORMATION FOR SEQ ID NO; 30: 

ti) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
IB) TYPE: nucleic acid 
iC) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULC TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
tiv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
GTGATGCGTC GGATCATC 

(2) INFORMATION FOR SEQ ID NO: 31: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID KG: 31 
TCGGTTCCAA GAGCT 

t2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH; I'S base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDMESS : single 
<D) TOPOLOGY: xmJcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TCGGTTCCAA GAGCT 15 
{2} INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

{±±) MOLECULE TYPE: CDWA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

CATTTGCTTC TGACACAACT G 21 

(2) INFORMATION FOR SEQ ID NOi33: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

{ii) MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTTCTCTGTC TCCACATGC 19 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: tinknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: HO 
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(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

- . ::Hvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
TGCACCTGAC TC 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: tmlcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
TGCTTACTTA ACCCAGTGTG 


(2) INFORMATION FOR SEQ ID NO: 36: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nuqleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CACACTATGT AATACTATGC 

(2) INFORMATION FOR SEQ ID NO: 37; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID :iO 37; 
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GAAAATATCT GACAAACTCA TC 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: un3cnown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEKSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CATGGACACC AAATTAAGTT C 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
TGAGACTCTG TCTC 

{2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 « base pairs 

(B) TYPE: nucleic acid 
(c) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

TTCCCCAAAT CCCTG 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 19 base pairs 
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(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

'' ' '"^(Xi) SEQUENCE DESCRIPTION: SEQ ID KG: 44 
GATGCCGATG ACCTGCAGAA G 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS ; single 
(D) TOPOLOGY: iinlciiown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
{iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 
CCCTTACCCT TACCCTTACC CTAA 

(2} INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
liv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
tvi) ORIGINAL SOURCE: 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 
AATCCGTGCA GCAGAGTT 

C2) INFORMATION FOR SEQ ID NO: 47: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STR7UIDEDNESS : Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 47 
TGTCAGAGCT GGACAAGTGT 
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(2 J INFORMATION FOR SEQ ID N0:4S: 

■ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
tiv) ANTISENSE: NO 

{V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

- ' (Xi) SEQUENCE DESCRIPTION: SEQ ID NO :4a 

GATATTGTCT TCCCGGTAGC 

(2) INFORMATION FOR SEQ ID NO; 49: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL: NO 
Uv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 
CTCGGACCAG GTGTACCGCC 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 50 
CCTGTACTGG AAGGCGATCT C 

(2) INFORMATION FOR SEQ ID NO: Si: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 
Uiil HYPOTHETICAL: NO 
{iv) ANTISENSE: NO 
;-iV) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 
CATGAGGCAG AGCATACGCA 

(2 J INFORMATION FOR SEQ ID NO: 52: 

ti> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE; nucleic acid 

(C) STRAKDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 52 
GACAGCAGCA CCGAGACGAT 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 53 
CGGCTGCGAT CACCGTGCGG 

(2) INFORMATION PGR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID ?i S4 
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GATCCACTGT GCGACGAGC ,. 

(2) INFORMATION FOR SEQ ID NO : 55 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(DJ TOPOIiOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 
GCGGCTGCGA TCACCGTGC 

(2} IITFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

{ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 
TGCACCTGAC TC 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: xanknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 
CTGTGGTCGT GC 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJSTDEDNESS : single 
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(D) TOPOLOGY: unknown 

.^.i) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GAGTCAGGTG CGCCATGCCT CAAACAGACA CCATGGCGC 3 9 

(2) INFORMATION FOR SEQ ID NO: 59: 

'(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: iinknovm 

{ii} MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
tiv) ANTISENSE: NO 
{v} FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
TCTCTGTCTC CACATGCCCA G 21 
(2) INFORMATION FOR SEQ 11^ NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

ACCTAGCGTT CAGTTCGACT GAGATAATAC GACTCACTAT AGCAGCTCTC ATTTTCCATA 60 
C 61 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 21 base pairs 
(B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 


55 
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(vi) ORIGINAL SOURCE: 
^ SEQUENCE DESCRIPTION: SEQ ID NO: 61 

AACTAAGCCA TGTGCACAAC A 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 
ID) TOPOLOGY: unknown 

"tii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 
UCCGGUCUGA UGAGUCCGUG AGGAC 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 
Uii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 63 
GUCACUACAG GUGAGCUCCA 

(2) INFORMATION FOR SEQ ID NO: €4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID ^JO:64 
CCAUGCGAGA GUAAGUAGUA 

C2) INFORMATION FOR SEQ ID NO:S5i 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 

, ... ^(B) TYPE: nucleic acid 
(C) STRANDETNESS : single 
{D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
AGGCCUGCGG CAAGACGGAA AGACCAUGGXJ CCCUNAUCUG CCGCAGGAUC 50 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAWDBDNESS ; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
{iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

fv) FRAGMENT TYPE J 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66 : 
CATTTGCTTC TGACACAACT 20 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTXSENSE: NO 

(v) FRAGMENT TYPE: 
(vij ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:67: 
TCTCTGTCTC CACATGCCCA G ' 21 
(2) INFORMATION FOR SEQ ID NO:Sa; 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: MO 
FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: €8 
GTCGTCCCAT GGTGCACCTG ACTC 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 
{A} LENGTH: 22 base pairs 
(B) TYPE: nucleic acid 

-'■(C) STRANDEDNESS : Single 
(D) TOPOLOGY: iinknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: MO 

(iv) ANTISENSEj NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 
CGCTGTGGTG AGGCCCTGGG CA 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlcnown 

(ii) MOLECULE TYPE: cDWA 
fiii) HYPOTHETICAL: NO 
(iv) ANTISENSE: WO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 
GACGACGACT GCTACCTGAC TCCA 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: \mknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID rJ-' 
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ACAGCGGACT GCTACCTGAC TCCA 

,_(2) INFORMATION FOR SEQ ID NO : 72 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) WmSENSE: NO 

(v) FRAGMENT TYPE: 
4vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

TGGAGTCAGG TAGCAGTC 

(2) INFORMATION FOR SEQ ID WO: 73: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) - TOPOLOGY: unknown 

(ii) MOLEOJLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 

<v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 
CAGCTCTCAT TTTCCATAC 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
fiii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 74 
AGCCCCAAGA TGACTATC 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS t single 

(D) TOPOLOGY: linknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 
CGAGGAGCTC AAGGCCAGAA T 

(2) INFORMATION FOR SEO ID NO: 76: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: imknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 
CAGGGGCAGC TCAGCTCTC 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE! 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 
GGCACGGCTG TCCAAGGA 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: xinknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOORCE: 
^("1) SKQUENCS DESCRIPTION: SEQ ID NO: 78 
AGGCCGCGCT CGGCGCCCTC 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: IB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
CTTACTTGAA TTCCAAGAGC 

(2) INFORMATION FOR SEQ ID NO: BO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
CC' STRANDEDNESS: single 
(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEHSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
GGGCTGACTT GCATGGACCG GA 

(2) INFORMATION FOR SEQ ID NO: 81 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
EC) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
AGCCAGGACA AG 
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(2) INFORMATION FOR SEQ ID NO; 82: 

/A V SEQUENCE CHARACTERISTI CS : 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
Ciii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 
ACAGCAGGAA CAGCA 

(2) INFORMATION FOR SKQ ID NO: 83: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
(C} STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

Cxi) MOLECULE TYPE: cDMA 
Ciii) HYPOTHETICAL: NO 
Civ) ANTI SENSE: NO 
Cv) FRAGMENT TYPE: 
Cvi) ORIGINAL SOUUCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
GCGGACATGG AGGACGTG 

(2) INFORMATION FOR SEQ ID NO: 84: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

CD) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL: NO 
Civ) ANTI SENSE: NO 
(v) FRAGMENT TYPE: 
Cvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 
GATGCCGATG ACCTGCAGAA G 

(2) INFORMATION FOR SEQ ID NO: 85: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
. . , '-Iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 
GTGCCCTGCA GCTTCACTGA AGAC 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86; 
AGCCAGGACA AG 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 - base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOTOCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
AGCCAGGACA AGTC 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLCX5Y: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 


125 


EP 1 164 203 A2 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

AGCC^^^ZT.',":^ aga 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
"(lii) HYPOTHETICAL: NO 

tiv) ANTISENSE: NO 
tv) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
ACAGCACCAA CAGCA 

(2) INFORMATION FOR SEQ ID NO; 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS; single 
(D) TOPOLOGY: unknown 

tii) MOLECULE TYPE: cDNA 
tiii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
ACAGCAGGAA CAGCATC 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: le base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
ACAGCAGGAA CAGCAG 
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(2) INFORMATION FOR SEQ ID KO:92: 

/■f-V SEQUENCE CHARACTERISTICS: 
{A} LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: imknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 
GCGGACATGG AGGACGTG 


(2) INFORMATION FOR SEQ ID NO : 93 : " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: single 
{D} TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGtlENT TYPE: 

(vi) ORIGINAL SOXJRCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 
GCGGACATGG AGGACGTGGC 

( 2 ) INFORMATION FOR SEQ ID NO : 94 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) - ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 
GCGGACATGG AGGACGTGC 

(2) INFORMATION FOR SEQ ID NO; 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: unknown 

- '■Ci) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) AKTISENSE: NO 

(v) FRAGMENT TYPE: 
(vi} ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:S5: 
GATGCCGATG ACCTGCAGAA G 21 

(2) INFORMATION FOR SEQ ID NO: 96: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GATGCCGATG ACCTGCAGAA GC 22 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AHTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GATGCCGATG ACCTGCAGAA GTG 23 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
fv) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 
- SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

GTGCCCTGCA GCTTCACTGA AGAC 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

"(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
tvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GTGCCCTGCA GCTTCACTGA AGACTG 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GTGCCCTGCA GCTTCACTGA AGACC 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S ingl 8 

(D) TOPOLOGY: unknown 

(ii) MOLECXJLE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TATCTGTTCA CTTGTGCCC 
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(2) INFORMATION FOR SEQ ID NO : 102 : 

- rr-'i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

'■(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

CAGAGGCCTG GGGACCCTG 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
{3) TYPE: nucleic acid 
CC) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO , 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION:, SEQ ID NO: 103: 
ACGACAGGGC TGGTTGCC 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
^C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
tiv) ANTISENSE: NO 

(v) FRAOIENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
ACTGACAACC ACCCTTAAC 


(2) INFORMATION FOR SEQ ID NO : 1 0 ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IS base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDSDNESS : single 

(D) TOPOLOGY: unknown 

MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 105: 
CTGCTTGCCA CAGGTCTC 

-.-(2) INFORMATION FOR SEO ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI^EDNESS : single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv> ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106; 
CACAGCAGGC CAGTGTGC 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISBNSE: NO 

(v) PRAO^IENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107; 
GGACCTGATT TCCTTACTG 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
. ,(yV^, FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
TGAATCTGAG GCATAACTG 19 


(2) INFORMATION FOR SEQ ID NO: 109: 

(i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
'(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDKA 
{iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
TTGCGTACAC ACTGGCCGTC GTTTTACAAC GTCGTGACTG GGAAAACCCT 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
GTAAAACGAC GGCCAGTGTG TACGCAA 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
TACTGaj^^C^G CGATCTCAGC AATCAGC 

(2) INFORMATION FOR SEQ ID NO : 112 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRAilDEDNESS : single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
-Uv) ANTI SENSE: NO 

(vj FRAGMENT TYPE: 
{vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 112 : 
GGCACGGCTG TCCAAGGAG 

(18 9) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

AGGCCGCGCT CGGCGCCCTC 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ. ID NO : 114: 
GUCACUACAG GUGAGCUCCA 

(2) INFORMATION FOR SEQ ID NO: 115: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 

■ ;3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 
GAATTCGAGC TCGGTACCCG G 

(2) INFORMATION FOR SEQ ID NO: 116: 
U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 
CCGGGTACCG AGCTCGAATT C 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 
CCTCTTGGGA ACTGTGTAGT ATT 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 
(Al LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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Ciii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
fv) FHAGMEUT TYPE: 
ivi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPXrOH: SEQ ID NO: 118; 

AGGCTGTCTC TCTCCCTCTC TCATACACAC ACACACACAC ACACACACAC ACACACACAC €0 
ACACACACAC TCACACTCAC CCACANNNAA ATACTACACA GTTCCCAAGA GG 112 

i2\ INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDMA 

(iii) HYPOTHETICAL: NO 
{iv> ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

{Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
TAATACGACT CACTATAGGG CGAAGGCTGT CTCTCTCCCT CTCTCATAC 4 9 

(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCS CHARACTERISTICS: 

(A) LENGTH: 135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

TAATACGACT CACTATAGGG CGAAGGCTGT CTCTCTCCCT CTCTCATACA CACACACACA 60 
CACACACACA CACACACACA CACACACACA CACTCACACT CACCCACANN NAAATACTAC 120 
ACAGTTCCCA AGAGG 135 

(21 INFORMATION FOR SEQ ID NO: L21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 

AATACT.'.C 3' AG 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : unknown 
<D) TOPOLOGY: unknown 

{ii) MOLECULE TYPE: C0NA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FXUVGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 
CTGATGCGTC GGATCATCTT TTTT 

C2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unknovfn 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 
GATGATCCGA CGCATCAGAA TGT 

(2) INFORMATION FOR SEQ ID NO: 12 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unltnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 
GATCTAGCTG GGCCGAGCTA GGCCGTTGA 

(2) INFORMATION FOR SEQ ID NO: 12 5: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

, :.-(C) STRANDEDNESS i single 

{D} TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI3ENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 
CTGATGCGTC GGATCATCTT tTTTTTT 

(2) INFORMATION FOR SEQ ID NO: 126: 

(1) SEQXIENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic- acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(vl^ ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
GATGATCCGA CG 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
GATGATCCGA CGCAT 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISEKSE: NO 

(v) FRAGMENT TYPE: 
J;,"!) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12B: 
AAAAAAGATG AT 12 
(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
'(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 129: 
GATCCGACGC AT 12 
(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 253 base pairs 
;b) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un3cnown 

(ii) MOLECULE TYPE: cDNA 
tiii) HYPOTHETICAL: NO 
(iv} ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

GGCACGGCTG TCCAAGGAGC TGCAGGCGGC GCAGGCCCGG CTGGGCGCGG ACATGGAGGA 60 
CGTGTGCGGC CGCCTGGTGC AGTACCGCGG C6AGGTGCAG GCCATGCTC6 GCCAQAGCAC 120 
CGAGGAGCTG CGGGTGCGCC TCGCCTCCCA CCTGCGCAAG CTGCC3TAAGC GGCTCCTCCG 180 
CGATGCCGAT GACCTGCAGA AGTGCCTGGC AGTGTACCAG GCCGGGGCCC GCGAGGGCGC 240 
CGAGCGCGGC CTC 2 S3 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(Xi) SEQXJENCE DESCRIPTION: SEQ ID MO:131: 
GAATTAror:?. CCCAACCGCG TGGCACAACA ACTGGCGGGC AAACAGTCGT TGCTGATT 58 
(2) INFORMATIOJT FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDMESS : single 

(D) TOPOLOGY: unknown 

{ii) MOLECULE TYPE: CDWA 
(iii) HYPOTHETICAL: NO 
(-iv) ANTI SENSE: NO 
(v) FRAGMENT TYPE: 
ivi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
ACCATTAAAG AAAATATCAT CTTTGGTGTT TCCTATGATG AATATAGAAG CGTCATC 57 
(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI5ENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
CTATATTCAT CATAGGAAAC ACCAAAGAT 29 
(2) INFORMATION FOR SEQ ID NO: 134: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: unknovm 

(il) MOLECULE TYPE: cDNA 
(iii J HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
CTATATTCAT CATAGGAAAC ACCAAT 26 
(2) INFORMATION FOR SEQ ID NO: 135: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

. /C) STRANDEDNESS: single 

(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13S 
CTATATTCAT CATAGGAAAC ACCAAAGAT 

(2) INFORMATION FOR SEQ ID NO: 136: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 
tiv) ANTISENSE: NO 
tv) FRAGMENT TYPE; 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 
CTATATTCAT CATAGGAAAC ACCAAAGATG ATATTTTC 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 7 
CTATATTCAT CATAGGAAAC ACCAATG ATATTTTC 

(2) INFORMATION FOR SEQ ID N0:13S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISEMSE: NO 

(v) FRAGMENT TYPE: 
Jvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO:138: 

CTATATTCAT CATAGGAAAC ACCAAAGATA TTTTC 35 

(2) IKFORMATIOH FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLBCXJLE TYPE: cDNA 
(iiij HYPOTHETICAL: NO 
(iv) ANTI SENSE: NO 

{v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
CTATATTCAT CATAGGAAAC ACCAAAGATG C 31 


(2) INFORMATION FOR SEQ ID NO: 14 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AKTISENSB: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 


CTTCCACCGC GATGTTGATG ATTATGTGTC 
TGTTTGTCQC OGGTCTGGTG TTGATGGTGG 
CCAAGCTTGG GATCCACCAC CATCACCATC 
CACTGGCCGT CGTTTTACAA 


TGAATTTGAT GGGGGCAGGC GGGCCCCGTC 
TTTCCTGCCT TGTCACCCTC GACCTQCAQC 
ACTAATAATG CATGGGCTGC AGCCAATTGG 


(2) INFORMATION FOR SEQ ID NO: 141: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS : single 

(D) TOPOLOGY: unknown 


(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) AWTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

GTCACCr'?^^" ACCTGCAGCC CAAGCTTGGG ATCCACCACC ATCACCATCA CTAATAATGC 
ATGGGCTGCA GCCAATTGGC ACTGGCCGTC GTTTTACAA 

(2} INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHAHACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJoiown 

(ii) MOLECULE TYPE: CDNA 
fill) HYPOTHETICAL: NO 
{iv} ANTXSENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 2: 
TGTACGTCAC AACTA 15 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEMSE: HO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 3: 
TGTACGTCAC AACTAC 16 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AKTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID riO 14 4 : 

TGTACGTCAC AACTACA 17 

(2) INFORMATION FOR SEQ ID N0:14S, 
(i> SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

. .,{C) STRANDEDNESS ; Single 

' (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: MO 
{iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(Xi) SEQtJENCB DESCRIPTION; SEQ ID NO: 14 5 
TGTACGTCAC AACTACAA 

(2) INFORMATION FOR SEQ ID NO: 146; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: HO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 
TGTACGTCAC AACTACAAT 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cMJA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147 
TGTACGTCAC AACTACAATA 

(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
, ,(r:v) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

TGTACGTCAC AACTACAATA G 21 

(2) INFORMATION FOR SEQ ID NO: 149: 

U) SEQUENCE CHARACTERISTICS: 
<A) LENGTH; 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
--(□) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
TGTACGTCAC AACTACAATA GG 22 
(2) -INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: imknown 

(ii) MOLECULE TYPE: CDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 
TGTACGTCAC AACTACAATA GGC 23 
(2) INFORMATION FOR SEQ ID NO-.lSl: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
TGTACGTCAC AACTAC/^TA GGCC 24 
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(2) INFORMATION TOR SEQ ID NO: 152: 

. /r,) SEQUENCE CHARACTERISTICS: 
"~ (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS ; single 

(D) TOPOLOGY: unknown 

{ii> MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
Cvi) ORIGINAL SOURCE: 

-fxi) SEOUENCE DESCRIPTION: SEQ ID NO: 152 

TGTACGTCAC AACTACAATA GGCCC 

{2} INFORMATION FOR SEQ ID NO; 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

Cxi) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 
TGTACGTCAC AACTACAATA GGCCCT 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

fii) MOLECULE TYPE; CDMA 
Ciii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 
TGTACGTCAC AACTACAATA GGCCCTG 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: Single 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPEt cDNA 
<iii) HYPOTHETICAL: NO 
. ^-v) AKTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO: IS 5 
TGTACGTCAC AACTACAATA GGCCCTGC 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 

--(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 
Civ) ANTISSNSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 
TGTACGTCAC AACTACAATA GGCCCTGCA 

(2) INFORMATION FOR SEQ ID NO: 15 7: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSEi NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157 
TGTACGTCAC AACTACAATA GGCCCTGCAC 

(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 
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TGTACGTCAC AACTACAATA GGCCCTGCAC C 

_ ,v; J2} INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES5 : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
-tvi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

TGTACGTCAC AACTACAATA GGCCCTGCAC CA 

(2) INFORMATION FOR SEQ ID NO: ISO: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: tmknown 

(ii) MOLECULE TYPE: . cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAG 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO. 161: 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAGG 

(2) INFORMATION FOR SEQ ID NO: 16; 

(x) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 
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(CJ STRAKDEDNESS : single 
(D) TOPOLOGY : unJcnown 

Ui) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMEHT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 162 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGC 

(2) INFORMATION FOR SEQ ID NO: 16 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
{B> TYPE: nucleic acid 

(C> STRANDEPNESS : single 
<D) TOPOLOGY: unknown 

(ix) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCC 

(2) INFORMATION FOR SEQ ID NO: 164: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
I iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: HO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCCA 

(2) INFORMATION FOR SEQ ID NO: 15 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGIITAL SOURCE: 

[':x) SEQUENCE DESCRIPTION: SEQ ID NO: 165 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCCAG 

(2) INFORMATION FOR SEQ ID NO: 16 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 base pairs 
{B) TTTPE: nucleic acid 

{C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
tiii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCCAGA 
(2) INFORMATION FOR SEQ ID NO: 167: 

(i> SEQUENCE CHARACTERISTICS : 

(A) IjENGXH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 
CTGATGCGTC GGATCATCC 

(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 
CTGATGCGTC GGATCATCCA 

(2 J INFORMATION FOR SEQ ID MO: 159: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
,!-3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 
CTGATGCGTC GGATCATCCA G 

(2) IMPORMATIOM FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 
CTGATGCGTC GGATCATCCA GC 

(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l7l 
CTGATGCGTC GGATCATCCA GCA 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
'v,) FRAGMENT TYPE: 
ivi) ORIGIK3VL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 
CTGATGCGTC GGATCATCCA GCAG 

(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 
CTGATGCGTC GGATCATCCA GCAGC 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: MO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 174 
CTGATGCGTC GGATCATCCA GCAGCA 

(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
(a) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175 
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CTGATGCGTC GGATCATCCA GCAGCAG 

, (2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: MO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 176: 
CTGATGCGTC GGATCATCCA GCAGCAGC 

{2} INFORMATION FOR SEQ ID NO; 177: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOUXSY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) - ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 
CTGATGCGTC GGATCATCCA GCAGCAGCA 

(2) INFORMATION FOR SEQ ID NO: 178; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 178; 
CTGATGCGTC GGATCATCCA GCAGCAGCAG 

( 2 ) INFORMATION FOR SEQ ID NO : 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 31 base pairs 

(B) TYPE; nucleic acid 


55 


152 


EP 1 164 203 A2 


(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

\ -Xl ) MOLECULE TYPE : CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: HO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOIIRCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 9 
CTGATGCGTC GGATCATCCA GCAGCAGCAG C 

(2) INFORMATION FOR SEQ ID NO: 180; 

Ti) SEQtreaifCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) AKPTISENSEi NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CA 

(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dingle 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAG 

(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
tv) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 182 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGC 

(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
to STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(Xi) MOLECULE TYPE; CDNA 
Uii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCA 

(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

(ii) MOLECUI^; TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; KO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAG 

(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unlmown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: KO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGC 

(2) INFORMATION FOR SEQ ID NO: 186: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3S base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCA 

(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTIOI? ; SEQ ID NO: 18 7 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAG 
(2) INFORMATION FOR SEQ ID NO :18a: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AKTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC 
(2) INFORMATION FOR SEQ ID NO: 18 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
. (VV PRAGMSMT TYPE: 

ivi) ORIGINAL SOURCE: 

{Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC A 
(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
{B) TYPE: nucleic acid 

. .-{C) STRANDEDNESS : single 
(D) TOPOLOGY: miJcnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE: 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO; 190 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AG 
(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGT 
(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 
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CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTC 
. , (2) INFORMATION FOR SEQ ID NO: 193: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : single 
(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
_|vi} ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCA 

(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

iii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:194: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCAC 
(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 47 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE:" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACG 
(2) INFORMATION FOR SEQ ID NO: 19^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 
ID) TOPOLOGY: unknovna. 

(li) MOLECULE TYPE: CDHA 

(iii) HYPOTHETICAL: NO 

(iv) AWTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196; 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGC 
(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDWA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 197: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCT 
(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDKA 
(iii J HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA 
(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDWA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE; 

(.^A.) SEQUENCE DESCRIPTION! SEQ ID NO: 199: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA A 

(2) INFORMATION FOR SEQ ID N0:200: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY! unknown 

-(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA AC 
(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: S3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unkno-^in 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:201: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACC 
(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 02: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCG 
(2) INFORMATION FOR SEQ ID NO:203: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 55 base pairs 

{B) TYPE: nucleic acid 

' "'(C) STRANDEDKESS : single 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 
<iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi} SEQUENCE DESCRIPTION: SEQ ID NO:203: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGA 55 
(2) INFORMATION FOR SEQ ID NO: 2 04: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 
iO STRANCEDNESS : single 

< D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID :JO:204: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAA 56 
(2) INFORMATION FOR SEQ ID NO: 205; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 205: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAAT 57 
(2) INFORMATION FOR SEQ ID NO: 2 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iiij HYPOTHETICAL'. NO 

(iv) antisensbj no 

^ (v) FRAGMENT TYPE: 
' i; vi ) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 206: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATC 58 
(2) INFORMATION FOR SEQ ID NO: 20 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRMTOEDNSSS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:207; 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCC 59 
(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQXJEHCE CHARACTERISTICS: 

(A) LENGTH; 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANBBDNESS : single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 208; 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGTVATCCC 60 
(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 209: 
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CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC €0 
T 61 


(2) INFORMATION FOR SEQ ID NO: 210: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: €2 base pairs 

(B) TYPE: nucleic acid 

(c) STRAKDECNESS : single 
(D) TOPOLOGY: unktiown 

Cii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 10: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TG 62 

(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOtJRCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC SO 
TGG 63 

(2) INFORMATION FOR SEQ ID NO: 2 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 64 base pairs 
<B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
fv) FRAGMENT TYPE: 
(vi) ORIGINAL SOtmCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 212: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC A:rrCACCCTA ACCGAATCCC 60 
TGGT 64 

(2) INFORMATION FOR SEQ ID NO: 2 13 
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(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 65 base pairs 

--„-r-(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TGGTC 65 

(2} INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TGGTCA 66 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 67 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TGGTCAG 67 

(2) INFORMATION FOR SEQ ID NO: 2 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: unknown 

MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL { NO 

(iv) ANTISENSE: NO 

(v) .FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TGGTCAGA 66 

{2} INFORMATION FOR SEQ ID NO: 2 17: 

(iJ SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 69 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDKESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:217: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TGGTCAGAT €9 

(2) INFORMATION FOR SEQ ID NO: 2 IS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: uil3cnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;218: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TGGTCAGATC 70 

(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) AKTI SENSE: HO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 
TGGTCAGATC T 

(2) INFORMATION FOR SEQ ID NO: 22 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: "unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 220; 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 
TGGTCAGATC TT 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: unjoiown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 
TGCACCTGAC TCC 

(2) INFORMATION FOR SEQ ID NO: 2 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 222: 
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TGCACCTGAC TCCT 

(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
is) TYPE: nucleic acid 
(C) STRAKDEDNESS ; Single 
{D) TOPOLOGY: unJoiown 

(ii> MOLECULE TYPE: CDNA 
tiii) HYPOTHETICAL: NO 
{iv) ANTISENSE: NO 
(v) FRAGMENT TYPE: 
jyi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 
TGCACCTGAC TCCTG 

(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE, NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 
TGCACCTGAC TCCTGT 

(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 225: 
TGCACCTGAC TCCTGTG 

(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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to STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

v-i.i} MOLECULE TYPE: cDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226 
TGCACCTGAC TCCTGTGG 

t2) INFORMATION FOR SEQ ID NO:227: 

(i) SEQUENCE CHARACTERISTICS: 
tA) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 227 
TGCACCTGAC TCCTGTGGA 

(2) INFORMATION FOR SEQ ID NO: 22 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228 
TGCACCTGAC TCCTGTGGAG 

(2) INFORiMATION FOR SEQ ID NO: 22 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

SEQUENCE DESCRIPTION: SSQ ID NO: 22 9: 
TGCACCTGAC TCCTGTGGAG A 

(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TO POLOG Y : unknown 

fii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
{v} FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 2 30: 
TGCACCTGAC TCCTGTGGAG AA 

(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknoim 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:231: 
TGCACCTGAC TCCTGTGGAG AAG 

{2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
<B) TYPE: nucleic acid 

{C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 32; 
TGCACCTGAC TCCTGTGGAG AAGT 

(2) INFORMATION FOR SEQ ID NO: 23 3: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 

,{B) TYPE; nucleic acid 
^ {C) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233 
TGCACCTGAC TCCTGTGGAG AAGTC 

(2) INFORMATION FOR SEQ ID NO: 2 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 26 base pairs 

(B) TYPE: nucleic acid 
(C} STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) CSQUENCE DESCRIPTION: SEQ ID NO: 234 
TGCACCTGAC TCCTGTGGAG AAGTCT 

(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 5 
TGCACCTGAC TCCTGTGGAG AAGTCTG 

(2) INFORMATION FOR SEQ ID NO: 23 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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Ciii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
: ^t^-- FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236 
TGCACCTGAC TCCTGTGGAG AAGTCTGC 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 
^' (C) STRANDEDNESS : single 

(D) TOPOLOGY: xinJcnown 

(ii) MOLECULE TYPE: CDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237 
TGCACCTGAC TCCTGTGGAG AAGTCTGCC 

(2) INFORMATION FOR SEQ ID NO: 23 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE.: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE^ DESCRIPTION: SEQ ID NO: 238 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG 

(2) INFORMATION FOR SEQ ID NO: 23 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 9 
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TGCACCTGAC TCCTOTGGAG AAGTCTGCCG T 

(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHAJIACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSEi HO 
(v> FRAGMENT TYPE: 

Jyi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TT 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTA 

{2) INFORMATION FOR SEQ ID NO: 24 2: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 2: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTAC 

(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 
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fC) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

Ui) MOLECXILE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACT 

(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQtTENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 
{iv) ANTISENSE: NO 

{v) FRAGMENT TYPE: 
{vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 244 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTG 

(2) INFORMATION FOR SEQ ID NO: 24 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

tii} MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 5 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGC 

(2) INFORMATION FOR SEQ ID NO: 24 6: 

(i) SEQUENCE CHARACTERISTICS : 
lA) LENGTH: 38 base pairs 
tB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCC 

(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: unknown 

(.ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
{iv) ANTISENSE; NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCC 

(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 40 base pairs 
(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: uziknovra 

Cii) MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24B 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT 
(2) INFORMATION FOR SEQ ID NO: 24 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT G 
{2} INFORMATION FOR SEQ ID NO: 2 50: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 

,. ^'p.) TYPE: nucleic acid 
TC) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AKTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GT 42 
(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 51: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTG 43 
(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDKA 

(iii) HYPOTHETICAL: NO 
tiv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGG 44 
(2) INFORMATION FOR SEQ ID NO: 2 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
. - /.y ) FRAGMENT TYPE : 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGG 
(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGG 
(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJTOEDNESS : single 

(D) TOPOLOGY: unknown 

{ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 2 55: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGC 
(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(iii J HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID HO . 2 56 -. 
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TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCA 
„ .42) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
.(yi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAA 
(2) INFORMATION FOR SEQ ID NO: 2 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG 
(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 2 59: 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG G 

(2) INFORMATION FOR SEQ ID NO:2G0: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 base pairs 

(E) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: imknown 

MOLECTJI.E TYPE: cDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG GT 52 

(2) INFORMATION FOR SEQ ID NO: 2 61: 

ti) SEQUENCE CHARACTERISTICS: 
{A> LENGTH: 209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
{iii} HYPOTHETICAL; NO 
(iv) ANTISENSE: NO 
{v} FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID KO;261: 

CATTTGCTTC TGACACAACT GTGTTCACTA GCAACCTCAA ACAGACACCA TGGTGCACCT 60 
GACTCCTGTG GAGAAGTCTG CCGTTACTGC CCTGTGGGGC AAGGTGAACG TGGATGAAGT 120 
TGGTGGTGAG GCCCTGGGCA GGTTGGTATC AAGGTTACAA GACAGGTTTA AGGAGACCAA 160 
TAGAAACTGG GCATGTGGAG ACAGAGAAG 209 

(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 
<A} LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDI^SS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 262: 

TGAGACTCTG TCTCAAAAAT AAATAAATAA ATAAATAAAT AAATAAATAA ATAAATAAAT 60 
AAATAAATAA GTAAAAAAGA AAGAATGC 88 

(2) INFORMATION FOR SEQ ID NO: 26 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDKA 

[iii) HYPOTHETICAL: NO 
-CV'V ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:263: 
GTGTGTGTGT GTGTGTGTT TTTTTTTAAC AGGGATTTGG GGAATTATTT GAGA 54 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 

--(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 
TTCCCCAAAT CCCTGTTAAA AAC 23 

(2) INFORMATION FOR SEQ ID KO:265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

( D ) TO POLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 
TTCCCCAAAT CCCTGTTAAA AAAAC 2S 
(2) INFORMATION FOR SEQ ID NO; 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
TTCCCCAAAT CCCTGTTAAA AAAAAAC 27 
(2) INFORMATION FOR SEQ ID NO: 26 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
Jiv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 7: 

GTAAAACGAC CGCCAGTGCC AAGCTTGCAT GCCTGCAGGT CGACTCTAGA GGATCCCCGG 60 
GTACCGAGCT CGAATTCGTA ATCATGGTCA TAGCTGTTTC CTG 103 


(2) INFORMATION FOR SEQ ID NO: 26 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

GAGTCAGGTG CGCCATGCCT CAAACAGACA CCATGGTGCA CCTGACTCCT GAGGAGNCTG 60 
GGCATGTGGA GACAG7M3A 78 


(2) INFORMATION FOR SEQ ID NO: 2 S3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECXJLE TYPE: cDNA 
(iii} HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 9: 
TCTCTGTCTC CACATGCCCA GNCTCCTCAG GACTCAGGTG CACATGGTGT CTGTTTGAGG 6 0 
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CATGGCGCAC CTGAGCTC 78 
(2) INFORMATION FOR SEQ ID NO: 270: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
{vi) ORIGINAL SOtJRCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:270: 

TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CGCCATGGTG TCTGTTTGAG 60 
GCATGGCGCA CGTGACTC 78 

(2) INFORMATION FOR SEQ ID NO; 271: 

it) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: MO 

(iv) ANTISENSE: NO 
iv) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

{xi) SEQUENCE DESCRIPTION: SEQ ID Np:271: 

TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CGCCATGGTG TCTGTTTGAG 60 
GCATGGCGCA CCTGACTCCT GA 82 

(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CG 42 
(2) INFORMATION FOR SEQ ID NO: 273: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 
..^ (C) STRANDEDNESS : single 

'■^ " (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 273: 
CACCTGACTC CTA 

(2) INFORMATION FOR SEQ ID NO: 2 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 
CACCTGACTC CTGGA 

(2) INFORMATION FOR SEQ ID NO: 27 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; xinknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 
CACCTGACTC CTGA 

(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: MO 
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(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;276: 
CCATGGTGTC TGTTTGAGGC ATGGCG 26 
(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
- .(D) TOPOLOGY: iinknown 

Ui) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:277: 

CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 
AGTCATCTTG GGGCT 75 

(2) INFORMATION FOR SEQ ID NO:27B: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pai:js 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single . 

(D) TOPOLOGY: \mimown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOtJRCEi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

ACCTAGCGTT CAGTTCGACT GAGATAATAC GACTCACTAT AGCAGCTCTC ATTTTCCATA 60 
C 61 

(2) INFORMATION FOR SEQ ID NO: 2 79: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 2 7 9 : 
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GDCACUACAG GUGAGCUCCA 

f2)_ -INFORMATION FOR SEQ ID NO: 280: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: uiilcnown 

tii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
tiv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 
CTCAQTCCAC GTGGTACCCT GCTG 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 85 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

CATTTGCTTC TGACACAACT GTGTTCACTA GCAACCTCAA ACAGACACCA TGGTGCACCT 
GACTCCTGAG GAGAAGTCTG CCGTT 85 


(2) INFORMATION FOR SEQ ID NO: 2 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

,(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282; 
ACGGGTCCCG GAGTGGTGTC GC 22 
(2) INFORMATION FOR SEQ ID NO: 2 83: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 76 base pairs 

,^_-.(B} TYPE: nucleic acid 

(CJ STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:283: 

ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG 6 0 
GTATCAAGGT TACAAG 76 

t2) INFORMATION FOR SEQ ID NO: 2 84: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 76 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOXmCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284; 

ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGATTG 60 
GTATCAAGGT TACAAG 76 

(2) INFORMATION FOR SEQ ID NO: 285: 

[i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

ai) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG 6 0 
GTATCAAGGT TACAAG 76 

(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
'\±±i) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2B6: 

ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG 60 
GCATCAAGGT TACAAG 76 


(2) INFORMATION FOR SEQ ID NO: 287: 

1i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 48 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 
ACAGGTTTAA GGAGACCAAT AGAAACTGGG CATGTGGAGA CAGAGAAG 43 
(2) INFORMATION FOR SEQ ID NO: 2 88: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 
GACGACGACT GCTACCTGAC TCCA 


(2) INFORMATION FOR SEQ ID NO :2a 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 


(iij MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(yi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

ACAGCGCACT GCTACCTGAC TCCA 24 

(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) 3TRANDEDNESS : single 

(D) TOPOLOGY: imknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 90: 
TGGAGTCAGG TAGCAGTC IB 
<2) INFORMATION FOR SEQ ID NO: 2 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single. 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTXSENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 91: 
CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 

(3) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SS base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)tnown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 92: 

AGTCATCTTG GGGCTGTCGA GAGTAAAAGG TATGTCAGTC ATAGTTAAGA CCTTCTTAAA 
GGTCT 6 5 
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(2) INFORMATION FOR SEQ ID NO: 2 93: 

(i) SEQUENCE CHARACTERISTICS: 
" (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDMA 

(iii) HYPOTHETICAL! NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

Jxi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 
GTAATTTCTA TCAGTAGAAC CCCGA 25 
(2) INFORMATION FOR SEQ ID NO: 2 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: €0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTXSENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 94: 
CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 
(2) INFORMATION FOR SEQ ID N0:29S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D} TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 
AGTCATCTTG GGGCT 15 
(2) INFORMATION FOR SEQ ID NO: 2 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
, ..Mv>. ANTI SENSE: NO 

(v) " FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 
CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 
(2) INFORMATION FOR SEQ ID NO: 2 97: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 

~'{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 
AGTCATCTTG GGGCTA 16 
(2) INFORMATION FOR SEQ ID NO: 2 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
CAGCTCTCAT TTTCCATACA TTAAAGATAG TCATCTTGGG GCT 43 
(2} INFORMATION FOR SEQ ID NO: 2 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAC31ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 99: 
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CAGCTCTCAT TTTCCATACA TTAAAGATAG TCATCTTGGG GCTA 
. (2 INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDWESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
{v) FRAGMENT TYPE: 
<yi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;300: 
CAGCTCTCAT TTTCCATACA GT 

(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: tinknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NC 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:301: 
CAGCTCTCAT TTTCCATACA T 

(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 302; 
GCCTGGTACA CTGCCAGGCG CTTCTGCAGG TCATCGGCAT CGCGGAGGAG 
(2) INFORMATION FOR SEQ ID NO: 3 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
tiv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 03: 
GCCTGGTACA CTGCCAGGCA CTTCTGCAGG TCATCGGCAT CGCGGAGGAG 
(2) INFORMATION FOR SEQ ID NO: 3 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(ivj ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 
GATGCCGATG ACCTGCAGAA G 


(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 
GATGCCGATG ACCTGCAGAA GC 

(2) INFORMATION FOR SEQ ID NO: 3 06; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(V) FRAGMENT TYPE:, 
(vi) ORIGINAL SOURCE: 

" '^oil) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

GATGCCGATG ACCTGCAGAA GTGC 

(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 
GATGATCCGA CG 

(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHAI^CTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

{Xi) SEQUENCE DESCRIPTION: SEQ ID NO:308: 
CTGATGCGTC GGATCATC 

(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
GATGATCCGA CG 
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(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 base pairs 
(Bj TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D} TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(viy ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 
GGCGCGGACA TGGAGGACGT GTGCGGCCGC CTGGT 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovn:! 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311 
TCCGCGATGC CGATGACCTG CAGAAGCGCC TGGC 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: tinknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 312 
CGGCTGCGAT CAGCGTGCGG CACAGCT 

(2) INFORMATION FOR SEQ ID NO : 3 1 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: unknown 

(ii) MOLECtJLE TYPE: CDNA 
C il i ) HYPOTHETICAL : NO 

(iv) AirriSENSE: MO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313 

CGGCTGCGAT CACCGTGCGG T 

(2) INFORMATION FOR SEQ ID NO: 314: 

(i,) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGME»rr TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314 
CGGCTGCGAT CACCGTGCGG AACAGCT 

(2) INFORMATION 70R SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31S 
CGGCTGCGAT CACCGTGCGG CA 

(2) INFORMATION FOR SEQ ID NO: 3 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 316 
CGGCTSr^^C:^-' CACCGTGCGG TA 

(2) INFORMATION FOR SEQ ID NO: 3 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
fiv) AKTISENSE: NO 

(v) FRAGMENT TYPE t 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317 
CGGCTGCGAT CACCGTGCGG A 

(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
to STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31S 
ATCATCAACT GGAAGATCAG GTCAGGAGCC ACTTGCCANC CT 
{2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
tD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319 
ATCATCACAC TGGAAGACTC CAGGTCAGGA GCC 

(2) INFORMATION FOR SEQ ID NO: 320: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
■^^ir» TOPOLOGY: unknown 


(ii) MOLECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
{V) FRAGMENT TYPE: 

10 (vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:320: 
ATCCACTACA ACTACATGTG TAACAGTTGG wGCwwGCC 48 


Claims 


1. A process for determining the order of base specifically terminated nucleic acid fragments of a target nucleic acid 
molecule, comprising the steps of: 

a) obtaining a nucleic acid molecule, comprising the target nucleic acid sequence and, at one end, a tag; 

b) generating base specifically terminated nucieic acid fragments from the target nucleic acid; and 

c) analyzing the fragments by a mass spectrometry format, thereby determining the order of the base specif- 
ically terminated nucleic acid fragments in the target nucleic acid molecule. 

2. A process of claim 1 , wherein in step b), a nuclease is contacted with the target nucleic acid to generate the base 
specifically terminated nucleic acid fragments. 

3. A process of claim 2, wherein the nuclease is a restriction enzyme that can recognize and cleave at least one 
restriction site in the target nucleic acid. 

4. A process of claim 2, wherein the target nucleic acid is a deoxyribonucleic acid and the nuclease is a deoxyribo- 
nuclease. 

5. A process of claim 2, wherein the target nucleic acid is a ribonucleic acid and the nuclease is a ribonuclease. 

6. A process of claim 5, wherein the ribonuclease is selected from the group consisting of: the G-specific T.| ribonu- 
clease, the A-specific U2 ribonuclease, the A/U specific PhyM ribonuclease, the U/C specific ribonuclease A, the 
C-specific chicken liver ribonuclease and crisavitin. 

7. A process of claim 1 , wherein in step b), the base specifically terminated nucleic acid fragments are generated by 
perfomiance of a combined amplification and base-specific termination reaction. 

8. A process of claim 7, wherein the combined amplification and base-specific termination reaction is performed using 
a first polymerase, which has a relatively low affinity towards at least one chain terminating nucleotide, and a 
second polymerase, which has a relatively high affinity towards at least one chain terminating nucleotide. 

9. A process of claim 8, wherein the first and second polymerases are thermostable DNA polymerases. 


10. A process of claim 9, wherein the thermostable DNA polymerases are selected from the group consisting of: Taq 
DNA polymerase, AmpliTaq FS DNA polymerase. Deep Vent (exo-) DNA polymerase. Vent DNA polymerase. Vent 
(6X0-) DNA polymerase. Vent DNA polymerase. Vent (exo") DNA polymerase. Deep Vent DNA polymerase, Thermo 
Sequenase, exo(-) Pseudococcus funosus (Pfu) DNA polymerase, AmpliTaq, Ultman, 9 degree Nm, Tth, Hot Tub, 

55 Pyrococcus funosus (Pfu) and Pyrococcus woesei (Pwo) DNA polymerase. 

11 . A process of claim 1 , wherein the base specifically terminated nucleic acid fragments generated in step b) include 
mass modified nucleotides. 
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12. A process of claim 1 , wherein the tag comprises a 3' tag. 

13. A process of claim 1 , wherein the tag comprises a 5' tag. 

5 14. A process of claim 12 or 13, wherein the tag is a non-natural tag. 

15. A process of claim 14, wherein the non-natural tag is selected from the group consisting of: an affinity tag and a 
mass marker. 

10 16. A process of claim 15, wherein the affinity tag facilitates immobilization of the nucleic acid to a solid support. 

17. A process of claim 16, wherein the affinity tag is biotin or a nucleic acid sequence that is capable of binding to a 
capture nucleic acid sequence that is bound to a solid support. 

15 18. A method for detecting neoplasia/malagnancies in atissueorcei! sample, comprising detecting telomerase activity, 
mutation of a proto-oncogene, or expression of a tumor specific gene in the sample by detecting nucleic acids that 
encode the telomerase, that are specific for the mutation or that encode the tumor-specific by mass spectrometry. 

19. The method of claim 1 8 that is a method for detecting a neoplasia/malignancy in a tissue or cell sample, comprising: 

20 

a) isolating telomerase from the sample and adding a synthetic DNA primer, which is optionally immobilized, 
complementary to a telomeric repeat, and all four deoxynucleotide triphosphates under conditions that result 
in telomerase specific extension of the synthetic DNA; 

b) amplifying the telomerase extended DNA product; and 

25 c) detecting the DNA product by mass spectrometry, wherein telomerase-specific extension is indicative of 

neoplasia/malignancy. 

20. The method of claim 19, wherein the primer contains a linker moiety for immobilization on a support; and the 
amplified primers are isolated by conjugating the linker portion to a solid support. 

30 

21 . A process for detecting telomerase activity in a biological sample, comprising the steps of: 

a) incubating the biological sample; a substrate primer, which can be extended by telomerase activity; and a 
complete set of deoxy nucleoside triphosphates; and 
35 b) detecting a telomerase extended substrate primer by mass spectrometry, thereby detecting telomerase 

activity in the biological sample. 

22. The process of claim 21 , wherein the substrate primer is immobilized to a solid support. 

40 23. The process of claim 22, wherein the substrate primer is immobilized in an array on the solid support. 

24. The process of claim 21 , further comprising amplifying the telomerase extended substrate primer prior to mass 
spectrometry. 

45 25. The method of claim 1 8 that is a method for identifying cells or tissues transformed by a mutant proto-oncogene, 
comprising: 

a) in a cell or tissue sample, amplifying a portion of a proto-oncogene that includes a codon indicative of 
transformation, wherein one primer comprises a linker moiety for immobilization; 
50 b) immobilizing DNA via the linker moiety to a solid support, optionally in the form of an array; 

c) hybridizing a primer complementary to the proto-oncogene sequence that is upstream from the codon 

d) adding 3dNTPs/1 ddNTP and DNA polymerase and extending the hybridized primer to the next ddNTP 
location; 

e) ionizing/volatizing the sample; and 

55 f) detecting the mass of an extended DNA indicative of a mutant proto-oncogene, thereby identifying cells or 

tissues transformed by a mutant proto-oncogene. 

26. The method of claim 25, wherein the proto-oncogene is the RET proto-oncogene. 
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27. The method of claim 26, wherein the codon indicative of transformation is codon 634 of the RET proto -oncogene. 

28. The method of claim 18 that is a method for detecting expression of a tumor-specific gene, comprising: 

5 a) isoiating polyA RNA from the sample; 

b) preparing a cDNA iibrary using reverse transcription; 

c) ampiifying a cDNA product, or portion thereof, of the tumor-specific gene with a set of primers, wherein one 
primer of the set of primers comprises a linker moiety; 

d) isoiating the amplified product by immobilizing the DNA to a soiid support via the linl<er moiety; 
10 e) optionaily conditioning the DNA; and 

f) ionizing/voiatizing sample and detecting the presence of a DNA peal<that is indicative of expression of the 
gene. 


29. The method of ciaim 28, wherein the ceils are bono marrow colis, the gene is the tyrosine hydroxyiase gene, and 
15 expression of the gene is indicative of neuroblastoma. 

30. A method for directiy detecting a double-stranded nucleic acid using matrix-assisted iaser desorption/ionization 
time-of-fiight (IVIALDi-TOF) mass spectrometry, comprising: 


20 a) isolating a double-stranded DNA fragment from a cell or tissue sample; 

b) preparing the double-stranded DNA for analysis under conditions that increase the ratio of dsDIMAissDNA, 
wherein the conditions include one or both of the following: preparing samples for analysis at a temperature 
of about A^C or less, and using a high concentration of DNA in the matrix to drive duplex formation; 

c) ionizing/voiatizing the DNA of step b), using a low acceleration voltage; and 

25 d) detecting the presence of the double-stranded DNA by MALDI-TOF mass spectrometry. 


31 . A method for comparing DNA samples to discern relatedness or to detect mutations, comprising: 


a) obtaining a plurality of biological samples; 
30 b) amplifying a region of DNA from each sample that contains two or more microsatellite DMA repeat sequenc- 

es; 

c) detecting the presence of the amplified DNA from each sample by mass spectrometry and comparing the 
molecular weights of the amplified DNA, wherein different sizes are indicative of non-identity or mutations 
between or among the samples. 

35 

32. The method of claim 31 , wherein non-identity is indicative of the presence of a mutation in the DNA in one sample, 
of non-relatedness, or of non-HLA compatibility between or among the individuals from whom the samples were 
obtained. 

40 33. The method of claim 31 or 32, wherein a plurality of marlcers are examined simultaneously. 
34. A method for identifying a target nucleotide in a nucleic acid sequence, comprising: 

a) amplifying at least a portion of the nucleic acid sequence comprising the target nucleotide using; 

45 

(i) a first primer, wherein: 


the 5'-6nd of the primer shares identity to a portion of the nucleic acid sequence immediately down- 
stream from the target nucleotide followed by a sequence encoding a unique restriction endonuclease 
50 site, and 

the 3'-end of the primer is self-complementary; and 

(11) a second downstream primer that contains a linker moiety, 

thereby producing an amplified double stranded nucleic acid comprising at least a portion of the 
55 nucleic acid sequence comprising the target nucleotide; 

b) immobilizing the amplified double-stranded nucleic acid to a solid support via the linker moiety; 

c) denaturing the immobilized nucleic acid and isolating the non-immobilized strand; 
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d) annealing the intracomplementary sequences in the 3'-end of the isolated non-immobilized strand, such 
that the 3'-end is extendable by a polymerase, thereby producing a self-annealed nucleic acid; 

f) extending the self-annealed nucleic acid by incubation with a polymerase, 3 dNTPs and 1 ddNTP, which 
corresponds to the missing dNTP, thereby producing an extended nucleic acid; 

g) cleaving the extended nucleic acid with a restriction endonuclease specific for the unique restriction endo- 
nuclease site; and 

h) identifying the target nucleotide. 

35. The method of claim 34, wherein identifying the target nucleotide is indicative of a mutation in the nucleic acid 
sequence. 

36. The method of claim 34, wherein the target nucleotide is identified based on the mass of the extended nucleic acid. 

37. The method of claim 35, wherein the mass of the extended nucleic acid is determined by mass spectrometry. 

38. A method for detecting a target nucleic acid in a biological sample using RNA amplification, comprising: 

a) amplifying a target nucleic acid using a primer comprising a sequence that is complementary to the target 
sequence, or a complement thereof, and a sequence that encodes an RNA polymerase promoter; 

b) synthesizing RNA using an RNA polymerase that recognizes the promoter; and 

c) detecting the resulting RNA using mass spectrometry, thereby detecting the presence of the target nucleic 
acid sequence in the biological sample. 

39. A process of detecting the presence of a target nucleic acid sequence, comprising the steps of: 

a) incubating in a reaction mixture 

1) an RNA polymerase, 

ii) nucleoside triphosphates, and 

ill) a nucleic acid molecule comprising the target nucleic acid sequence, or a complement thereof, and a 
promoter for the RNA polymerase, 

thereby producing an RNA molecule comprising the target nucleic acid sequence, or a complement 
thereof; and 

b) detecting the RNA molecule by mass spectrometry, thereby detecting the presence of the target nucleic 
acid sequence. 

40. The process of claim 39, wherein the nucleic acid molecule comprising the target nucleic acid sequence is DNA 
and the RNA polymerase is a DNA dependent RNA polymerase. 

41. The process of claim 40, further comprising, following step a) and prior to step b), 

inactivating the RNA polymerase; and 
digesting the DNA using RNAse-free DIMAse I. 


45 42. The process of claim 39, further comprising: 


hybridizing a detector oligonucleotide to the RNA molecule comprising the target nucleic acid sequence, 
wherein the detector oligonucleotide is complementary to a portion of the target nucleic acid sequence; and 
removing unhybridized detector oligonucleotide; 

50 

wherein, in step b), the RNA molecule comprising the target nucleic acid sequence is detected by detecting 
hybridized detector oligonucleotide. 


43. A method for detecting the presence of a target nucleic acid sequence, comprising: 

55 

a) amplifying the target nucleic acid sequence using a primer comprising a sequence that is complementary 
to at least a portion of the target sequence, or a complement thereof, and a sequence that encodes an RNA 
polymerase promoter, thereby producing an amplified nucleic acid molecule comprising the target nucleic acid. 
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or complement thereof, and an RNA polymerase promoter; 

b) incubating the amplified nucleic acid molecule with nucleoside triphosphates and an RNA polymerase that 
recognizes the promoter, thereby producing an RNA corresponding to the target nucleic acid sequence; and 

c) detecting the RNA using mass spectrometry, thereby detecting the presence of the target nucleic acid se- 
5 quence. 

44. A procGss for detecting a target nucleic acid sequence present in a biological sample, comprising the steps of: 

a) obtaining a nucleic acid molecule containing a target nucleic acid sequence from a biological sample; 
10 b) immobilizing the target nucleic acid sequence on a solid support via thiol linkages, whereby the target nucleic 

acid sequence is present at a sufficient density to detect it using mass spectrometry; 

c) hybridizing a detector oligonucleotide with the target nucleic acid sequence; 

d) removing unhybridized detector oligonucleotide; 

e) ionizing and volatizing the product of stop c); and 

15 f) detecting the detector oligonucleotide by mass spectrometry, 

wherein detection of the detector oligonucleotide indicates the presence of the target nucleic acid sequence in the 
biological sample. 

20 45. The process of claim 44, wherein the target nucleic acid sequence is amplified prior to immobilization. 

46. The process of claim 44 or 45, wherein at least one of the detector oligonucleotide or the target nucleic acid 
sequence has been conditioned. 

25 47. A process of any of claims 44-46, wherein the solid support is selected from the group consisting of: beads, flat 
surfaces, pins and combs. 

48. A process of any of claims 44-47, wherein the target nucleic acid Is immobilized in the form of an array. 
30 49. A process of any of claims 44-48, wherein the support Is a silicon wafer. 

50. A process of any of claims 44-49, wherein the target nucleic acid sequence is amplified by an amplification pro- 
cedure selected from the group consisting of cloning, transcription, the polymerase chain reaction, the ligase chain 
reaction, and strand displacement amplification. 

35 

51 . A process of any of claims 44-50, wherein the mass spectrometer is selected from the group consisting of: Matrix- 
Assisted Laser Desorption/lonization Time-of-Flight, Electrospray, Ion Cyclotron Resonance, and Fourier Trans- 
form. 

40 52. A process of any of claims 44-51 , wherein the sample is conditioned by mass differentiating at least two detector 
oligonucleotides or oligonucleotide mimetics to detect and distinguish at least two target nucleic acid sequences 
simultaneously. 

53. A process of claim 52, wherein the mass differentiation Is achieved by differences In the length or sequence of the 
45 at least two oligonucleotides. 

54. A process of claim 53, wherein the mass differentiation is achieved by the introduction of mass modifying func- 
tionalities in the base, sugar or phosphate moiety of the detector oligonucleotides. 

50 55. A process of claim 52, wherein the mass differentiation is achieved by exchange of cations at the phosphodiester 
bond. 

56. A process of any of claims 44-55, wherein the nucleic acid molecule obtained from a biological sample is amplified 
into DNA using mass modified dideoxynucleoside triphosphates and DNA dependent DNA polymerase prior to 

55 mass spectrometric detection. 

57. A process of any of claims 44-56, wherein the nucleic acid molecule obtained from a biological sample is amplified 
into RNA using mass modified ribonucleoside triphosphates and DNA dependent RNA polymerase prior to mass 


199 


EP 1 164 203 A2 


spectro metric detection. 

58. A process of any of claims 44-57, wlierein tine target nucleic acid sequence is indicative of a disease or condition 
selected from the group consisting of a genetic disease, a chromosomal abnormality, a genetic predisposition, a 

5 viral infection, a fungal infection and a bacterial infection. 

59. A process of any of claims 39-58, wherein the detector oligonucleotide is a peptide nucleic acid. 

60. A primer, comprising at least about 20, preferably about 16, bases of any of the sequence of nucleotides sequences 
10 set forth in SEQ ID NOs. 32-38, 41-86, 89, 92, 95, 98, 101-110, 112-123, 126, 128 and 129. 

61. A primer, comprising at least about 20, preferably about 16, bases of any of the sequence of nucleotides sequences 
set forth in SEQ ID NOs. 1-22, 24, and 27-32. 

15 62. A primer of claim 60 or 61 that is unlabeled, and optionally includes a mass modifying moiety, which is preferably 
attached to the 5'end. 

63. The method of any of claims 1-1 7 and the process of any one of claims 1 8-59, wherein a nucleic acid is immobilized 
to a solid support via a selectively cleavable linker. 

20 

64. The method or the process of claim 63, wherein the linker is thermocleavable, enzymatically cleavable, photo- 
cleavable or chemically cleavable. 

65. The method or the process of claim 63, wherein the linker is a trityl linker 

25 

66. The method or the process of claim 65, wherein the linker is selected from the group consisting of 1-(2-nitro-5- 
(3-0-4,4'-dimethoxytritylpropoxy)phenyl)-1 -0-((2-cyanoethoxy)-diisopropylaminophosphino)ethane and 1 -(4- 
(3-0-4,4'-dlmethoxytritylpropoxy)-3-methoxy-6-nltrophenyl)-1-0-((2-cyanoethoxy)-dllsopropylamlnophosphlno) 
ethane. 

30 

67. The method or process of any of claims 1-59 and 63-66 wherein a primer is a peptide nucleic acid. 

68. The primer of claims 60-62 that is a peptide nucleic acid. 
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. PARTIAL SEQUENCE OF THE^-GLOBiN TEMPLATE 
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SEQUENCE OF THE AMPUHED 209 bp PCR-PRODUCT OF THE ^-GLOBIN GENE 


FORWARD PRiMER: p2 
CATTTGCTTC TGACACAACT GTCnCACTA GCAACCTCAA ACAGACACCA 
12mer PRIMER 

TC ffffcfcACtff ^AgfaCTO TG GAGAAGTCTG CC6TTACTGC CCTCTGGGGC 
AAGGTGAACG T6GATGAAGT TGGTGGTGAG 6CCCTGGGCA GGTTSGTATC 
AAGGTTACAA GACAGGTrTA AGGAGACCAA TAGAAACTGG GCAT6TGGAG 

ACAGAGAAG 
REVERSE PRIMER ^11 
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5' -GTGTGTGTGTGTGTGTGTTnrr (TT) (TT) AACAGGGAn7GGGGAATTATnGAGA-3' 
PRIMER TTGTCCCTAAACCCCTT (4443.0) 

T5 AU£t£ CAAAM TTOTCCCTAAACCCCTT (6890.B) 

T7 ALLELE CAAAAA AA — TTGTCCCTAAACCCCTr (7515.0) 
T9 ALIELE CAAAAA AA AA nCTCCCTAAACCCCTT (8139.4) 
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